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Effect of Water on the Induction Period of the 
Polymerization of Methyl Methacrylate’ 


I. C. Schoonover, G. M. Brauer, and W. T. Sweeney 


rhe effect tl 
zation of methy 
and 
markedly 
to the 


absence 


of water In monomer or 
methacrylate 


shortened Dy smal 


Was Inve 


inhibitor ovl peroxi 
quantities of 
onal centration and 
In the 


concentratior 


water col 
I nhibitor it is nversety 
effect 


water-containing n 


peroxide 


acceleratior 
enzovl 
radical 


nonomeyr 


sufficient t 
the reac 


concentratio which Is 


does not alter appreciably 


l. Introduction 


ect of impurities in the monomer on the 
the induction period and the kinetics of 
wlymerization has been described by Bart- 
Altschul [1],' Cohen [2], Ford [3], Kolthoff 
\ 1, 5] Melville and Watson 6], Price 

7}, and others. The effect of water on 
erization of methy! methacrylate has not 
rted in the literature. Since experiments 
iboratory indicated that cast polymer made 
hv! methacrylate containing water is more 

to crazing, the present work was under- 
study the effect of water on the bulk poly- 
1of1 ethyl methacrylate. 


2. Materials 


ercial monomer (Rohm & Haas) was puri- 


100-mm pressure In 
iss helices. The purified monomer was 
a refrigerator and was again fractionated 
ely before This distillate had a re- 
ndex n> of 1.4120 and a density d? of 
Further fractionation did not change these 
Tests showed that the distilled 
contained less than 0.002 percent of free 
s than 0.06 percent of water [8], and no 
9] or hydroquinone [10] 


use 


man Kodak reagent grades of benzoyl peroxide 


» 


nately 97 percent pure), hydroquinone, and 
-butyrate, which was redistilled (bp 101.1 
(), were used without further purification. 


3. Procedure 
3.1. Flow-Time Measurements 


ren was bubbled through the purified mono- 
10 to 15 min, and benzoyl peroxide and 
were added After dividing the 
ito two parts, a measured amount of water 


none 


efore the XIIth International 
w York, September 1951 
sackets indicate the literature references 


yngress of Pure and Applied 
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le cat 


lirectly 


j 


aly 
water 


proportior i 


proportional to 


aused bv the 


inomer, with a 


shorten tl 


rate alte 

Ten milliliters of each 
pipetted into calibrated 
The viscometers were 
constant-temperature bath main- 
tained at 71.1 0.1° C (160° F), and the time was 
noted This time taken the start of the 
reaction. The viscometer openings were covered 
with tinfoil, and no effort was made to exclude OXYV- 
ren {ll viscometers were of the same size and had 
approximately the same air space above the liquid 


The flow 
frequent intervals 


was added 
of the 
Ostwald-Fenske viscometers 
then placed in a 


to one por tion 


Lwo samples was 


Wis 


as 


time of the samples was measured at 
The initial flow time of the 
monomer was 6.9 sec (0.59 cps Measurements 
were continued until a flow time of 100 sec 
reached, which is equivalent to a polymer \ ield of 9 
percent. All runs were repeated at least 
The reaction times for 100-sec flow usually agreed 
within 10 percent. Uninhibited samples with cata- 
lyst concentrations 0.01 percent showed 
larger variations, probably due to the presence of 
monomer peroxide 


was 


once 


below 


No effort was made in this investigation to deter- 
mine the exact length of the induction period, which 
may be defined as the time required to establish a 
steady-state concentration (equilibrium) of free 
radicals. Reaction times for 100-see flow are used 
in the subsequent of the induction 
period This corresponds to a relative flow time 
defined as flow time of sample per flow time mono- 
mer) of 14.5. It evident from the flow time 
versus reaction time curve that the induction period 
ends at an earlier reaction stage, probably in the 
neighborhood of flow Inspection of the 
data obtained showed that the reference pomt 
adopted did give the same quantitative relationship 
between water, catalyst, and inhibitor concentration 
and reaction time as earlier flow times and vielded 
more reproducible results 


discussions 
Is 


] 5 sec 


3.2. Benzoyl Peroxide Decomposition Measurements 


Decomposition of benzoyl peroxide in methyl 
methacrylate and methyl n-butyrate was checked by 





both colorimetric and volumetric methods 


359 





For colorimetric determinations, volumetric flasks 3 tffect water on the induction peri 


i of methyl methacrylate 


containing solutions of freshly distilled monomer and 
varving amounts of benzoyl peroxide were placed In atalyst entration 0.02 ¢ 
a constant-temperature bath kept at 71.1° +0.1° C 
160° F Periodically, 10-ml samples were with- 
drawn, and the peroxide content was analyzed 
colorimetri ally by the n ethod suggested by Barnes 
Q) 
To make volumetric determinations the benzoyl 
peroxide was dissolved in methyl n-butyrate (bp 
101.1° to 101.8° C), and the solutions were stored in a 
constant-temperature bath kept at 71.1 mae” & 
Periodically, 2-ml samples were removed and ana- 
lyzed by the method of Swain Stockmeyer, and necessarv to obtain a product of 100-s 
Clarke [11] creases linearly with the quantity of wat 
For decomposition measurements conducted in a | Table 2 lists the reaction time necessary 
nitrogen atmosphere the solution was placed in | given flow time for the polymerization cat 
glass reaction vessels fitted with covers containing a | varying concentrations of benzovl pero 
gas-inlet tube, exit stopcock, and self-sealing stopper. | water Typical curves for reaction time 
Nitrogen was bubbled through the liquid for 20 min time for anhydrous and water-containing s: 
and the stopcocks were closed. Periodically, 2-ml | shown in figure 1. In figure 2 the logarit! 
samples were removed by means of a syringe and relative flow time is plotted versus the react 
analyzed for benzoy! peroxide Nitrogen was flushed The nearly identical slope of the logarit] 
through the apparatus at approximately 6-hr inter- | relative flow time versus reaction time curves |] 
vals. The decomposition rate was followed until at | 9 relative flow time of 4 indicates that the pres 
least 50 percent of the benzoyl peroxide had | water affects only the length of the induction 
decomposed and not the subsequent polymerization rat 
3.3. Infrared Measurements time necessary to obtain a 100-sec-flow prod 
anhydrous and water-containing monome! 
versely proportional to the square root of the ec: 
concentration (fig. 3). Thus the kinetics follo 
usual course for the catalyzed addition poly: 
enZ0y pe roxide os tion 12] The effect of water decreases wit 
creasing catalyst concentration and becomes 1 


Infrared-absorption curves were obtained with a 
Perkin-Elmer double he am ree ording spec trophotom- 
eter to chee k for the possible presence of av hvdrated 


4. Results and Discussion ble at concentrations greater than 1 g/liter 


The effect of water on the induction period of the 
methyl methacrylate polymerization is shown in = TT 
table 1. Within experimental error, the time 15 
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in many 
me inhibitor is not 
onducted with varving concentrations of 
Vater, and cataly St The effect of hydro- 
n the length of the induction period of 
and water-containing monomer is shown 
In figure 4 is shown the linear increase of 
on period with increasing inhibitor con- 
This proportionality of induction period 
mumone probably indicates a stoichiometri 
p between inhibitor and free radicals 
sence of hydroquinone * only influences 
of the induction period, but also acts as a 
fic. 5 Even after completion of the 
eriod, the rate of polymerization is con- 
ess than in the of inhibitor This 
m increases as the initial concentration of 
Similar results were obtained 
none inhibited polymerization [13]. This 
iv be due to the fact that (1) the poly- 
n begins while hydroquinone is still 
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Effect of water and catalyst on methe 
4 
methacrylate polymerization 


n, g/100 ml of monomer; ne, 100 seconds; temper 


present and 2 the inhib tion 
inert toward radicals, that is 
the poly merization 

The leneth of the induction period Is 
proportional to the percentage 
the monomer (fig. 6 In the 
0.5 g/liter benzoyl peroxide 
quinone a 100-sec flow is reached after 700 min. A 
similarly prepared sample containing 1 percent of 
water reaches this flow within 370 min. 

The effect of water on the induction period de- 
creases rapidly with increasing catalyst and decreas 
ing inhibitor content (see figs. 4 and 7 KE xtrapola- 
tion to zero hydroquinone concentration gives for 
anhydrous and | percent water-contamimg monomer 
to which 2.5 g/liter catalyst has been added a reaction 
time of 31 min for the formation of 100-sec-flow 
polymer. This is in excellent agreement with ex- 
trapolated values from curves of the reaction time 
versus catalyst 


products are not 
thev in turn influence 


inversely 
initially in 
polymerization with 
and 0.06 g/liter hydro- 


ot wate! 
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Figure 8 shows the polymerization rate curves for 
hydroquinone-inhibited monomer containing 0.5 and 
7.5 g of benzoyl peroxide per liter of monomer. With 
inhibitor no simple relationship was found between 
peroxide content and induction period covering the 
entire catalyst concentration range investigated. At 
concentrations above 1 g/liter the 100-sec-flow value 
is inversely proportional to the initial catalyst con- 
centration (dg. 7), that is, the rate determining step 
during the induction period is probably second order 
with respect to catalyst, although the kinetics are 
complicated by the fact that the presence of hydro- 
quinone or its conversion products accelerate the 
peroxide decomposition [2]. At catalyst concentra- 
tions below 1 g/liter, where the catalyst-inhibitor 
ratio is small, the effect of hydroquinone becomes so 
pronounced that it alters the kinetics of the reaction. 

Measurements of the effect of water on the induc- 
tion period in the absence of catalyst (thermal poly- 
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merization) are within the limits of 
error. The time in which 100-sec-flow produc 
were attained varied from 254 to 265 min for ani 
drous and water-containing samples. The ind 
period and rate of the thermal polymerizaty 
influenced to a large extent by traces of impurit 
Air initially present retards the polymerizat 
anhydrous and water-containing samples. 
percent increase in the length of the inductior 
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experiunenhia 


derable decrease in the polymerization 
bserved due to preferential reactivity of 
d monomer with oxygen 

the induction period of the thermal 
on remains constant in the presence of 
»bserved change of the induction period 
vzed reaction must be due to an increase 
of peroxide in aqueous 
v catalyst concentrations. Hence, more 
that will act as polymerization initiators 


mposition rate 


ied in the presence of water 
further the effect of water, the peroxide 
on rate measured by comparing 
allv the peroxide concentration of mon- 
had been kept at 71.1° C for various 
A slight but noticeable increase in 
observed in 


Because 


was 


TRATION, rhe 


rate monomer 
of at this 
mperature decomposition measurements 
are difficult to conduct due to the small 
catalyst that be used to obtain an 
induction period more accurate dete 
vith 0.05 V7 benzov!l peroxide in methyl 
a prototype of methyl methacrylate that 
wolymerize) were conducted. An increase 
mately 30 percent in the reaction rate 
for water-containing solutions was observed. 
lecomposition rates were determined in a 
the was not ap- 
1 percent of water. Thus 
necessary for the acceleration of the 
decomposition by water First-order ki 
icative of a free radical mechanism were 
up 50-percent decomposition. The 
rate constants for a reaction time up to 20 
proximately 50-percent decomposition) calcu- 
the method of least squares amounts to 
hr~' and 0.0378 hr~' for anhydrous and water- 
ng solutions, respectively. This difference 
constants is not considered significant. Large 
from these rate constants in the later 
ges of the decomposition are due to (1) the effect 
the presence of reaction products formed, (2) 
losses due to evaporation, and (3) possible 
ence of traces of oxygen in the solution 
length of the induction period with the more 
2-azo-bis-isobutyronitrile catalyst also 
tened with water-containing monomer 


position 
| 


Was 


percent wate! 


must 


itmosphere reaction 
accelerated by 


to 


rele 


\ 


tions 


he 


is 


\ddition of D,O to monomer results in an induction 
«| approximately equal to that of monomer con- 
the same percentage of H,O. Thus, within 
mits of the experimental error (10 percent), no 
effect is observed Water seems to speed 
catalyst decomposition only and apparently 
not affect the chain propagation and termination 


he 


thas been reported by a number of investigators 
tnot only the rate but also the stoichiometry of 
mposition of benzoyl peroxide varies from 
0 solvent as well as with the peroxide con- 
n initially present [14,17]. Because highest 
nposition rates have been observed in highly 
associated solvents, such as phenols, amines, 


} 
dece 


ent 


alcohols, and acids [11, 14, 15, 16], an increase in the 


presence of water should be exper ted 
If the decomposition proceeds by the mechanism 
pre viously postulated 16, 17 


C,H,COO 2C,.H.COO 


> 
\ 


> 


C.H.COO-+ RH <.H.COOH+R 


C,H.COO C.H.COO R+C,H,COO 


The results suggest that the benzoyl radical favors 
capture of the hydrogen of the polar water molecul 
instead of of covalent C—H_ bonds The 
OH radical formed will react with benzoyl radicals 
to perbenzoic acid instead of a benzoi 
ester which formed if C—H bond 
This peracid readily decomposes into free radicals 
that will also speed up the peroxide decomposition 
Hence, the more readily the benzovl radical 
react with RH, the greater the speed of reaction 

From the detailed studies of the kinetics of th 
methyl methacrylate polymerization, Schulz and 
coworkers [18, 19] assumed that catalyst and mono- 


fission 
rive acid 
( le aN ed 


is is 


can 


mer exist in equilibrium with a complex that re- 
arranges to give the first radical unit in the growing 
chain, but no independent evidence of this complex 
was reported. The increased rate of decomposition 
of catalyst observed in mixtures of benzene and vinyl 
acetate as compared with the rate in benzene alone 
[20], as well as the changing stoichiometry of the 
decomposition of finite concentrations of benzoyl 


peroxide in benzene [2], has also been ascribed to the 
formation of some intermediate complex, possibly 
(C,H;COO),-C,H,, in a preliminary, rapidly reversi- 


ble step These data, as well as the present results, 
suggest that a hydrated peroxide complex formation 
takes place in the presence of water, with the sub- 
sequent rapid decomposition of this thermally un- 
stable intermediate. However, polymerization of 
samples of monomer containing equivalent amounts 
(based on peroxide) of reagent grade and hydrated 
benzoyl peroxide (approximately 12 percent of water) 
show that such a hydrate does not appreciably 
decrease the length of the induction period Fur- 
thermore, the absence of hydrated peroxide com- 
plexes at room temperature was established from a 
series of infrared-absorption measurements of 1.5- 
percent solutions of benzoyl peroxide in chloroform 
containing 0.15 percent of water 

The more rapid decomposition can also be ex- 
plained in terms of the ‘“‘cage effect” as postulated by 
Matheson [21]; that two fragments from the 
dissociated molecule find themselves held in a cage 
of solvent molecules, and therefore the radical pairs 
collide many times with each other and with the 
surrounding molecules before they can diffuse apart 
The water may presumably permit diffusion to occur 
On the other hand, the water may be 


of the 


1S, 


more rapidly 
effective in reducing the activation energy 
reaction between radical and monomer 
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5. Summary 


1. The length of the induction period of the benzov! 
peroxide catalyzed polymerization of methyl methac- 
rviate is markedly shortened at low catalyst con- 
centrations by the presence of water in the monomer. 
2. The length of the induction pe riod is inverse ly 
proportional to the water concentration and directly 
proportional to the hvdroqu none concentration in 
the monomer In the absence of inhibitor, it is 
inversely proportional to the square root of the 
catalyst concentration 

) Presence of water in the monome! does not 
appre iably alter the reaction rate after the induction 
period 

4. As a slight increase in free radical concentra- 
tion will shorten the induction period, the more rapid 
decomposition of benzoyl peroxide in water-con- 
taining monomer causes these effects. Possible 
mechanisms of this effect have been suggested 
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Transmittancy of Commercial Sugar Liquors: 
Dependence on Concentration of Total Solids 


V. R. Deitz, N. L. Pennington, and H. L. Hoffman, Jr. 


l. Introduction 


n flux by absorption from a beam of 
rgvy passing through a solution has long 
for the determination of dissolved 

The scattering of radiant energy is known 
with the optical measurements, but since 
of the interference 


sub- 


is not well understood. 
rally been easier to minimize the scattering 
rect for it. There are many 
ily important solutions, sugar liquors for 
which both absorption and scattering are 
In order to characterize the optical 

of such influence of the 
tion of total solids upon the spectral trans- 


howe er. 


solutions the 


of these solutions has been studied 
meentration of the solids in 
il sugar liquors is small compared to that 
al solids and, furthermore, is not subject to 
enient direct measurement. In addition 
se and water, the liquor contains dissolved 
spended materials of variable 
absorb and 


nonsucrose 


composition 
light to different 
procedure for 
concentration of 
al sugar liquors by the concentration of 
ds, realizing always that the 
npose only a very small fraction of the total 
this may be different for each commercial 
Thus, it present to 
transmittancy measurement corresponding 
tive concentration 


hat scatter 


Therefore, it is common 


} ibe the 


hnologists to dese 


nonsucrose 


is only possible at 
of nonsucrose solids, it 
ng tacitly assumed that the light-absorbing 
scattering materials are diluted in propor- 
total solids. 
ild be noted that all transmission measure- 
ported in this paper were made on a model 
kman Spectrophotome tel Because of the 
of light-scattering materials, the values of 
tancy will be a function of the geometrical 
ent of the source, test medium, and the 
This arrangement varies in instruments 


of different manufacture 
also be obtained be CAUSE 


Different values may 
of variations in the relative 
amounts of absorption and scattering material that 
may be present in particular sugar 
liquors from different sources 


commere ial 


2. Definition of Optical Terms 


The opti al nomenclature used in this paper Is that 
given by K. S 1949 [3 Absorbancy is 
defined as 


Gibson in 


r ; 
oOo” a - ro 
log ] 


where 7, and i. are the transmittance of 
solution and solvent, respectively, and 7, is the 
transmittancy. Transmittance is defined as the 
ratio of the transmitted flux to the incident flux of 
the spectral energy being used. The definition of 
differs from that previously used by 
Bates and Associates [4] in 1942, where absorbancy 
was defined as |1 T 7, being the transmittancy of 
the solution 
The absorbancy ince X 1s dk fined as 


the 


absorbancy 


which is equivalent to the specie absorptive index 
used by Bates and Associates and denoted by them 
as log t In eq 2). bis the cell length, and ¢ is the 
concentration of the solute or constituent of interest 
When applied to commercial sugars, ¢ 1S the concen- 
tration of total per milliliter of 
solution Total comprise the and 
nonsugars, including the colored material 

The treatment of Bates or of 
correct only when no radiant scattered 
from the initial direction of propagation. Equation 
2) is a formulation of the Lambert-Beer law [5] and 
ias long been used by sugar technologists to deter- 
mine the value of the index, a,, from 
observations on “optically clear’ solutions. 
The term “optically clear’ has been used to indicate 
that the solutions did not appre iably scatter light 
It is now recognized that the sucrose molecule itself 


solids in grams 


solids sugars 
Gibson is strictly 
energy 1s 


abso ban \ 


sugal 
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scatters light to a measurable extent [6]. The 
scattering in intensively purified sucrose solutions is 
small and is of the magnitude ascribed to molecular 
scattering For these solutions the treatment of 
Bates or Gibson is valid, but it is not valid for the 
general case of commercial sugar liquors. The 
absorbancy index is constant in general only for non- 
scattering solutions. The present work shows that 
this is not the case for commercial sugal liquors 


In order to avoid ambiguity, the term“ absorbance, 
index” will not be used when both absorption and 


scattering decrease the intensity of light Instead 
the term “‘attenuation index’’ will be used. The 
attenuation index is an unknown function of absorp- 
tion and scattering and cannot be readilv analyzed 
into its components: (1) the molecular absorption 
due to dissolved substances, (2) the absorption by 
the suspended particles ) the s« attering by the 
suspended particles, and (4) the scattering by the 
molecules of the solution (negligibly small in the 
case of sucrose solutions The attenuation index, 
a; howev er, can by defined in terms of the observed 
transmittancy, the cell length and the concentration 
of total sugar solids as follows 


In the absence of scattering, the attenuation index 
is equal to the absorbancy index (a,), which is inde- 
pendent of the concentration but dependent on 
wavelength, while in the absence of absorption it is 
a measure of the scattering and dependent on con- 
centration and w avelength In the presence of both 
absorption and scattering, the ratio defines a value 
for attenuation index at any given concentration 
The attenuation index depends upon the cell length 
and the position of the cell insofar as they influence 
the fraction of total transmitted energy intercepted 
by the phototube in the measuring instrument. 


3. Correction for the Presence of Sucrose 
and Water 


In order to obtain a* due to the impurities of a 
commercial sugar solution, it is necessary to correct 
the observed transmittancy for the influence of 
sucrose and water. It is convenient to obtain the 
transmittancy of the solutions by using water in the 
reference cell of the spectrophotometer 

Sucrose solutions were intensively purified with 
adsorbents and the transmittancies determined. 
The purified sucrose solution was prepared as follows: 

A sample of commercial granulated sugar was 
dissolved at 80° C in distilled water to 60.4 Brix ‘ 
(0.778 g/ml). The solution was treated four succes- 
sive times with a washed activated carbon (15 
g/liter of solution), stirred for 30 min at 80° C, and 
filtered after each of the four treatments through 
an asbestos pad formed on No. 40 Whatman paper. 
After the last carbon treatment, the solution was 
filtered through a “‘fine” sintered glass filter. The 
pH was always kept between 6.8 and 7.2. 


‘ “Brix” is defined as the percentage t weight 
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In figure 1, a3 for a sucrose solution (0.778 
is plotted against the wavelength The cel 
was 10cm. It may be observed that a? is less 
zero for wavelengths from 500 to 900 mug 
is due to the fact that that reference cell cont 
more water than the sucrose solution Th 
observed at 740 my and 860 my coincid 
harmonics in the infrared absorption spe 
water [7] 

The correction for sucrose and water has 
applied at each wavelength to the data obta 
for commercial sugar liquors. The correction 
cell lengths of 1 cm or less has been found t 
smaller than the uncertainties of the measurem 
in the visible spectrum. For very accurate wor 
in cell lengths greater than 10 em, it is rec 
mended that the correction for sucrose and wat 
be determined in the instrument available for 
work and at the particular concentrations and 
lengths desired. 


4. Experimental Results 


The sugar products investigated include Cuba 
and Hawaiian raws, washed Cuban raw, granulated 
sugars, invert mixtures, impure beet sugar, and 
blackstrap. The transmittancies were measured 
wavelengths between 400 and 820 my with a Beek: 
man model DU spectrophotometer. The cell lengt! 
was chosen so that the transmittancy readings wer 
made in the range of 20 to 95 percent. These soli 
tions were prepared by adding distilled water # 
room temperature to the raw sugar and warming t! 
mixture with stirring to 80° C. After stramm 
through a 60-mesh sieve, the cooled mixture wi 
adjusted to a pH between 6.8 and 7.2 with a fer 
drops of a dilute NaOH solution. The concentr- 
tions less than the maximum (about 0.75 g/ml) wer 
obtained by dilution with water. The cell depth wa 
0.20 em. When the data are plotted in the conve! 


ip 


tional manner, that is, —log 7, against c, the de 
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CONCENTRATION OF TOTAL SOLIOS, g/ml 
Relation between limiting slope of Beer's law and 
the interce pional a: versus c plot 


an unfiltered Hawaiian raw sugar liquor at 560 my 


» from Beer’s law is quite evident. Figure 2 is 


ich a plot of the data for the Hawaiian raw sugar. 


It has been found that an accurate linear depend- 
m concentration is given by a plot of 1/a? 
It is readily seen that the value for the 

ting slope on the Beer’s law plot shown at the 


p of figure 2 is equal to aj, the reciprocal of the 


tercept. If af denotes the attenuation index at 


value as the concentration 
(infinite dilution), it 


and a; the 
zeTo 


ap- 
Is possible to 


efine the straight line by the following relationship: 


ei mek eich 
a* os & 


ype is numerically 
intercept is (l/az 
as can be more 
il extrapolation. 

al data are contained in figure 3 for the 
ttances of unfiltered Hawaiian raw sugar 
s plotted according to eq (4). It is significant 
slope increased with increase in wavelength, 
reason for this is not evident. The increase 
may depend upon the ratio of the absorption 


equal to [(1/at)—(l/ay)] 


However, the values of 
readily obtained from a 
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impure beet 


FiGguRe 4 Linear plots for several raw sugars, 

sugar, and blackstrap observed at a wavelength of 560 mu 
to scattering at the various wavelengths. The slope 
has not been observed in any case to be negative. 

The relationship given by eq (4) has been found 
to be valid for a large variety of sugar products 
Typical data for some of these at 560 muy are given in 
figure 4. The washed Cuban raw was observed in 
a 1.00-cm cell, the washed raw beet, Hawaiian raw, 
Cuban caw in 0.2-cm cells, and the blackstrap in a 
0.041-cm cell 

The data for two unfiltered granulated sugar li- 
quors are presented in figure 5. In these solutions 
the value of the attenuation index is quite small. 
Because of the relatively small quantity of absorbing 
nonsucrose solids, the small residual scattering in a 
granulated sugar solution contributes materially to 
the observed attenuation index. correction 
for scattering is necessary if a measure of the absorp- 
tion alone is desired 

An obvious method of modifying the turbidity of 
sugar liquors is by high-speed centrifuging. A Cuban 
raw liquor was subjeeted to this treatment, and the 
results are of some interest. A gravitational field of 
22,000 times gravity was obtained at an operating 
speed of 15,000 rpm, 100,000 times gravity ata speed 
of 40,000 rpm, and 150,000 times gravity at a speed 
of 50,000 rpm. All runs were made at substantially 


some 


room temperature by means of special refrigeration 


of the equipment 
|\1/a*] versus c 


Figure 6 is a plot of the resultant 
After the centrifuging operation at 


search Section, National 


ted to G. H. Hogeboom of the er Re 
h, for { e equipment for attaining 


the use 


nd 150,000 times gravity 


the ultracentrif 
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Ficure 5 


the highest gravity, a solid phase was observed at 
the surface of the liquor as well as at the bottom, 
which indicated a range of density of the suspended 
materials 

It may be seen from table 1 that the result of cen- 
trifuging the 60 Brix liquor at 22,000 times gravity 
was to decrease the attenuation indices af and af*. 
Apparently, the conditions of this experiment re- 
sulted in about the Same percentage decrease in both 
ay and aj. As the strengih of the gravitational 
field increased, the values of a} and af approached 
each other, indicating that the slope of the straight 
lines approach zero with increase in centrifuging 
The values of af and af are influenced by time of 
centrifuging, gravity, and concentration of solution 
(which influences the viscosity of the solution) as is 
shown in table 1. 
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the re 
! 


on ¢ might be a general phenomenon. Figu 
contains results for suspensions of a carbon bi: 
titanium dioxide, a filter aid, and char dust 


srix granulated sugar solution. 


The cell let y 


indicated and the transmittances were measure 
The suspensions were prepared by add 


560 mu. 


each of the following materials to 30-g portions 
60 Brix liquor: carbon black (0.1mg), titanium ios 
(0.33 mg), filter aid (1.0 mg 
The dependence of 1/a* on ¢ was found to be 

and this was also true for the same systems obser 
Similar results were obta 


at other wavelengths. 
um dioxide 
glycerine and for powdered mica suspended in 
Calcium oxalate suspensions prepared by pri 


for 


tion 


titani 


were 


also 


found 


and 


to 


and ¢har di 


graphite 


follow this 


ist (87.01 


dispersed 


relat 
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sus that in applications to turbidimetric | and absorption in the visible range of wavelengths, 
is not necessary that the suspension be | but may contribute appreciably in ultraviolet regions 
diluted to obey Beer’s law. By using | Such impurities may significantly influence the re- 
of plotting herein proposed, the concen- | fining characteristics of the sugar liquor However. 
ge of suspended material can be greatly | the transmittancy measurements in the visible region 
turbidimetric analysis serve a definite purpose, and they have the added ad- 
vantage that the systems under study are not affected 
5. Remarks by the measurements 
ci pe ndence ot la Ipon ¢ supplenx nts 6. References 
eer’s law considerations in dilute solutions 
ney data on commercial sugar liquors can 
orted as attenuation indices at some refer- 
itration. It appears that either af or a? 
| for this purpos« 


ttancy values have iong been used to 
1.000 n 


e purifying effects of various adsorbents, 


e char, on commercial sugar liquors This . tee gy wont we BS Circular 440 
s observations on the sugar liquors before 

ie treatment with the adsorbent It is 

eciate that the information obtained from 

rements is not the complete story in re- 

adsorbent probl m There may be pres- 
vet significant impurities (for example, in- 

ts) that contribute little to the scattering WASHINGTON, September 19, 
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Extension of the First Spectrum of Rhodium (Rh1)' 
Robert J. Murphy 


p 


The are spectrum of rhodium has been investigated photographicall 
to 12000 A rhe spectrum was excited in conventional direct-current arcs and photograp 
on infrared sensitive emulsions with the aid of concave diffraction gratings Wavelen 
have been measured for 238 Rh lines, including 90 new lines, of which 53 have been els 
fied Two new evs evels have been added to the list of known term values: thes 


evels account for 18 of the 53 newly classified Rb1 lines 


1. Introduction measuring approximately 6 by 10 mm, were made by 
pressing rhodium-metal powder in a Dietert hydraulic 
lvsis of the Rhi spectrum has been ex- press These solid cylinders of rhodium were pinched 
» the long-wave limit of photographic | in slotted copper rods that served as electrode 
ensitivity, in order to provide additional | holders. The power was drawn from a 220 v d-c 
sof Rhifor volume II] of Bureau Circular | circuit, with currents of 6 to 8 amp 
wing compiled [1] All the spectrograms 1n this investigation were 
for studies of Zeeman effect, no investiga- | made with concave diffraction gratings set up in 
is spectrum have been made since 1925, | Wadsworth mountings, as described by Meggers and 
gers [2] published a paper on the platinum | Burns [8] 
That paper gave the wavele neths of 551 The range from 6300 to 8700 A was photographed 
nes between 4504 and 8615 A. In 1901 | with Eastman I-N plates, from 8000 to 10200 A with 
Snyder [3] published ‘The wave-number | Eastman I-Q plates, and from 97C6 to 12000 A with 
rhodium”, containing regularities among | Eastman I-Z plates. The I-Q and I-Z plates were 
nes The significance of these regularities hypersensitized according to the method described by 
ealized until 25 years later, when a partial | Burka [9]. This consisted of bathing the plates for 2 
tion of this system was given by Meggers | min in 2- to 6-percent solutions of 28-percent am- 
| monia, followed by a 2-min wash in alcohol and 
to 1940 the only analysis of the Zeeman | rapid drying with warm air 
f rhodium was that given by Sommer [5] in For the N-sensitivity region, a 15,000 lines/in 
He lists 136 atomic energy levels and classifies | grating was used. The first-order spectrum of this 
ately 1,000 lines, about 90 of which are | grating has a reciprocal dispersion of 4.85 A/mm 
nied by data on the Zeeman effect. In 1940 | The long-wave Z-sensitivity region was photographed 
ilysis by Molnar and Hitchcock [6] improved | in the first order of a 7500 lines/in. grating. In its 
nded the earlier work of Sommer [5]. They | first-order spectrum this spectrograph has a reeipro- 
velengths, relative intensities, and vacuum | cal dispersion of 10 A/mm. The Q-sensitivity region 
imbers (both observed and computed) for 446 | lying between the N- and Z-sensitivity regions, was 
from 1988 to 8615 A, in addition to 29 terms, | investigated with both gratings described above 
lues for 108 levels. The term values from Exposures varied from 20 min in the N region to 
per have been used in the present investiga- | 4 hr and 7 hr, respectively, in the Q and Z regions 
In the long-wave region, a dark-red-glass filter placed 
the extension of the description and analysis | in front of the spectrograph slit eliminated second 
| from 8615 to 11021 A is desirable and has been | and higher overlapping spectral orders and _ per- 
possible mainly by the use of the new photo- | mitted the first-order Rhi spectrum to be photo- 
emulsions now available for spectroscopic | graphed. For shorter wavelengths, a yellow-glass 
es. These emulsions are sensitive much farther | filter was used to give the first-order spectrum of 
the infrared than they were in 1925. This is | rhodium. 
strated by the thallium line at 13013 A recently For the spectral ranges covered by the I-Z and 
tographed [7] in the Bureau’s Spectroscopy | I-Q plates, iron wavelength standards adopted by 
oratory, which sets a new mark for long-wave | the International Astronomical Union [10] were 
phy of laboratory are sources. taken from the second- or third-order spectra and 
, , cony erted to first order. but for the range observed 
- Experimental Details with I-N plates the iron comparison spectrum was 
recorded in the same first order as the rhodium 
pectrum, Rh1, as presented in this paper was | spectrum 
by means of direct-current arcs between 3. Results 
s of metallic rhodium. These electrodes, 
sane Measurements of the line positions for wave 
was submitted to Boston College in partial fulfillment of the length determinations, and visual estimates of line 


the Master of Scie “ein N 1952 ° . . . . » Tr 
the Master of Science . ee in May 19 De einen intensities and character, were made on a Gaertner 





moving-carriage comparator. This instrument gives , intensities is from “1?", which indicates 

readings directly to | Mu Each Rht line retained in difficult to bisect, to “800” for the strong S 
the final list was observed and measured at least The final values of wavelengths, estimat 
three times, and many were remeasured several | intensities and line character. vacuum 

times to increase wavelencth aecuracy. The prob bers, and term combinations of classifie. 
able error for most wavelengths is approximately 
0.02 A Toward the long wavels neths, where most 
of the lines were recorded as hazy. which makes an 


given in table 1 For classified lines 
number calculated from the energy ley 
accurate comparator setting quite difficult, the error given The term notation is that used 

approaches 0.05 \ The rang of estimated relative and Hitcheock 6] 


1 
6332 
6342 
6363 
6376 
6386 


6402 
6410 
6413 
6414 
H41%8 


6431 944. 9] ( 6877 
6434 15537. 7 6877 
6445 : 56 3] 6879 
6452 ’ ! 5 | ; ) ' (aj HSOS 
6456. ! | 6905 


H461. | I } 0* ) Ol! 
6479 ; 242 3 ; 6919 
6485 L541 3 ; i 6925 
6496 6951 
6499, 52 2 ! 5 6965 


6511 . 7 ABS ; ‘ a- . ous 
6517. 5 1533' 3 5 l 6994 
6519. 67 5333. |‘ , O3 ) ‘ ‘ 7001 
6591 5165. 7 5 ‘ 0 : 7009 


6510. 35 15355. ' . 6972 


6618. 55 15104. & 93 7038 
6619 15102. 85 2 ) 214 7061 
6627 15083. 85 ; l 7101 
6630 , 15978. 55 b 2 7102 
6632 15074 3. | ; 7104 


6637 5 15062 
6643. 65 5 15047 
6645. 04 15044 
6658. 85 : 15013 
6659. 13 15012 


6662. 99 15004. 1: 3. 97 ; 7186. 95 13910. $ 
6672. 33 14983 ; — 7199. 5: 13885 
6673. 25 14981 *Gaug , 219 

6686. 63 14951 1 Garg 7P3 | 2% 

6705. 37 j 14909. ; 2 


) 
» 


6716. 68 . 14884. § “4—2°*G3, | 7240.; 

6721. 31 14873 : Sag — z * Pig 7256. 5: 
6725. 48 14864. 7: . as 7262. 62 
6730. 78 : 14853. 03 | ; 2 *Ghu 7268. 23 
6752. 34 14805 5 - z ‘Dig 7270. 85 
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S162 
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3000 
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2921. 5 l 9040 

2RSS Y 0041. 37 11057 


12866 QO58. 26 11036 
12861 Fi 2 QOR3. 47 11005 
12860 9102. 10 } 10983 
12830 ; ‘ 4124 10056 
12776 - ' 126. 13 10954 


10926. 12 
1LOSUS, OS 
10841. 50 
10829. 53 
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10719. 7 
10718. 1 


10678. 65 


l 
é 


10607. 68 
10572. 49 
10548, 34 
10528. 3 
10408. 14 
10483 


10477 


LO4St 

O580. 5 10434 
507 10416 
OOS 10407 
615 2 LOSUS 


O620. 75 10301 
0626. 53 2 LOSRD 
9632. £ 10378 
0671. 76 10336 


Although others have published data on Rhi 
extending to 8615 A, a comparison of the intensities 
of corresponding lines shows that the present esti- 
mates are on a more open scale that is approximately 
five times greater than that used in earlier publica- 
tions In addition, new lines have been discovered 
in the regions previously investigated. Of the 238 
new wavelength measurements here listed, 148 had 
previously been measured by Meggers [2], of which 
111 were included by Molnar and Hitchcock [6] 
With the aid of a grating ruled with 15,000 lines/in., 
it has been possible to increase the precision of some 
Rhi wavelengths. With the new photographic 
emulsions it has been possible to investigate this 
spectrum out to 12000 A, although there is only one 
Rh line beyond 11000 A 

One of the main difficulties encountered in the 
spectral observation of the platinum metals in open 
air ares is the presence of many atomic oxygen and 
nitrogen lines, and unidentified molecular lines, par- 
ticularly between 8300 and 11000 A. A compari- 
son with this Bureau’s wave length data, published and 
unpublished, shows that table 1 contains no lines 
originating from atmospheric constituents in the 
atomic state. Atmospheric molecular lines wer 
more troublesome, and it was a problem to deter 
mine which lines could be legitimately discarded as 
such. Fortunately, a photograph of the i idiun 
spectrum in the same region was available, and all 
lines appearing in both spectra were ascribed to 
atmospheric spectra and removed from the Rh1 list 
Several calcium, sodium, potassium, and copper 
lines were also present in the rhodium spectra, but 
the first three would be expected in almost any 
metallic sample. The copper lines were introduced 
when the are jumped to the copper electrode holder 


Wave number 


A m 
9713 10201 
9716. 97 5 10288 
9748 2 10254 
9757 l 10246 
O761 10241 
O7900. 7 2 10210. ' 
US46 10153 
OR50 ; 3 10148 
OSS86 Qh 10112 
9048. 3 10049 


g059. 6 , 10037 
10015. 4: 2 OO8] 
10082. 3 9915 
LOLLY 2 9870 
10121. 06 : 9876 


LOLS. ¢ OR12 
10499. 2° 9521 
LOSOU 9248 
11021.7 9070 


3.1 Classification of Observed Lines of Rh! 


With the term values as given by Molnar 
Hitchcock 6], a systematic subtraction betw 
even and odd levels with differences between § 
and 17000 cm~! was carried out. This serve 
classify 35 lines belonging to known terms. = L 
dition, a search for all the known term separat 
among three dozen of the stronger lines still ren 
ing unclassified has resulted in the discovery of t 
new even levels. With these two new levels 
at 23655.93 cm™', suspected to be a b*P 
and the other at 27876.52 cm™', suspected to 
b ?G,, level, it was possible to classify 18 mor 
of Rhi. One of these lines had previously been 
correctly classified This is seen in the following 
example by the much closer agreement between t 
observed and computed vacuum wave numbers g 
by the new 6?7G level 


Wave number 


Wave- 


j Classificatior 
length 


*Com 


Observed 
puted 


cn 22, y* Dix 


My 26 
12700 (former 
b? Gay — 2? Fxg (pre 


and in table 1 for the comy 


fo nae 
The new transitions in the infrared have result 


in the assignment of J-values to two known 
for which the values were previously ambi, 
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yllow] 


rer 


nm ¢ 


| 20y6 as at 51324.35 cm combines with 
ving J-values of 1% and 3'4, which, according 
pnsition rule of AJ=0 1, determines the 
o be 2% In like manner, the level 24 

54 em~ combines with levels of J/=0 

ind has therefore been assigned J=1 


yarison of the observed and computed wave 


-93 


for classified lines shows that the differences 

0.04 em-', with 30 percent of the lines 

to within 0.03 em These differences are 
error of measurement for long waves 
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Heat Generation in the Setting of Magnesium 
Oxychloride Cements 


Edwin S. Newman, John V. Gilfrich, and Lansing S. Wells 


l. Introduction 


esium oxychloride cement, also called mag 


Is used exte nsively for 
iormed DY the 


ment or Sorel cement 
laces It is 
ad magnesium solution on active 
im oxic Numerous filler materials, such 
hardwood sawdust, tale, and 
| silica, are incorporated to impart desired 
finished floor The 
materials together appears to 
oxvehloride MeO. MeCl] 11H.O 
instable in contact with 

s containing less than 11 percent of MgCl 
sequently the floor must be protected from 
tion of water The 
are such that 
nterstices of the 
oxychloride 
maductive to electricity 


action of cone- 


chloride 


tos fiber sand 


properties to the com- 
nding these 
magnesium 


nis compourn is 


conditions of mixing and 
MeCl, solution is retained 
hardened cement This 
and renders the 
a desirable property 
electricity may 


preserves the 


tions where sparks of stati 
hazard 
no information in the 
re on the amount of heat generated by mag- 
oxvchloride cement while hardening. Chas- 
2] measured the temperature rise during 
of MgeO-Me ‘1.-H,O mixtures and found that 
mperature rises of the pastes and their 7-day 
reased together. However. 
no values for the quantity of heat venerated. 
rk reported in this paper had for its primary 
the the amount of heat 
ted by several commercial flooring cements 
lition, similar information was obtained con- 
mixtures of MgO and MgCl, solution without 
rs ordinarily used 


appr ars to he littl oO! 


ssive strengths nat 


determination of 


2. Apparatus and Procedure 


heat-of-solution method [3] for determining 
it of hardening is not easily applicable to com- 
il oxychloride cements because of the large 
rtion and inhomogeneous nature of the filler 
als used in their preparation. Consequently, 


cate literature references at the end of this paper 


direct methods were l it termine the heat of 
hardening of the 
of Tests the 


In one 
heat generation was determined by 


commercial cements roup 


recording automatically the temperatures attaimed 
when cement samples enclosed in vac 
allowed to Detailed the 
the Ss been published 
Weighed samples of the pastes were placed in 


ium flasks were 
harce n 


descriptions ot 


apparatus used in tests have 
t, 5) 
y-pt Iriction-top cans provided with thermocouple 
Three-junction 
inserted, and the 


in widemouthed 1-pt va 


wells. copper-constantan thermo- 


piles were assemblies were placed 
flasks The flasks 
were corked and plac ed in an air bath maintained at 
30.0° €C The temperatures attained by the 
during the ensuing 18 hr recorded 
The vacuum flasks calibrated electrically 
Heaters were built and embedded in plastic in cans 
identical with used for the cement samples 
Th placed in the the 
power input was maintained at a constant level until 
the temperature within the cans became 
At this point the heat flow from each flask was equal 
to the power input to that flask Suitable 
resistances were arranged to enable the 


pastes 
were 
were 


thos 


assemblies were flasks, and 


constant 


auxiliary 
current and 
recorded on the 
Measurements 
besides zero) of power input 


across the heater coils to be 
chart as the 
made at two levels 
to the flasks 

The heat-transfer coefficients somewhat 
with temperature because of the large differences in 
temperature between the interior of the flasks and 
the surrounding au bath The total heat lost 
a cement sample was evaluated from the time-tem- 
perature curve, using the measured the 
heat-transfer coefficient The sensible heat remain- 
ing in the sample, usually a small quantity 
culated as the product of the final temperature dif- 
ference and the estimated heat capacity of the sample 
and containe! 


voltage 
same 
were 


temperatures 


varied 


from 
values of 


was cal- 


A second seriés of tests were made with a conduc- 
tion calorimeter similar to that described by Lerch 
6). This apparatus was also calibrated electrically. 
Because of the more rapid flow of heat from the 
specimen, the conduction calorimeter permitted the 
hardening of the cement to occur at lower tempera- 
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tures than did the vacuum flasks. The heat of Six samples of magnesia used for ma 
hardening was obtained from the calibration con- | oxychloride cement and for other purpos 
stant and the integration of the time-temperature | obtained from a producer. These samples 
difference curves In both these methods the inte- | differing activities In table 1 are show 
gration was extended over an 18-hr period details of their physical and chemical pri 
The heats of hardening of oxychloride pastes pre- Magnesium chloride of analytical reagent pu 
pared from MgO, MgCl,.6H,O, and water were de- | used to make solutions for use with these n 
termined by a modification of the standard method | samples 
of determining the heat of hydration of portland 
cement [3]. The heats of solution of MgO and of 4. Results and Discussion 
the hardened pastes were determined in the vacuum- 
flask calorimeter in 425 g of 2.00 NV HCl. The heats 4.1. Direct Measurements 
of dilution of MgCl, solutions were determined in the 
same quantity of acid to which 1.0 g of MgO had The compositions of the active portions of 
previously been added The MgO content of the commercial cements were calculated iro 
paste was determined by dissolving a sample inan | magnesia contents and directions for mixing 
excess of standard hydrochloric acid and titrating the | by the manufacturers. Figure 1 shows the e 
excess HC] with NaOH solution. Methyl red was | tions of the cement (half-shaded circles) in t 
used as an indicator In order to prevent the pre MeO. H.O. and MeClL.6H.O. The con 
cipitation of Mg(OH),, approximately 1 g of solid | 5\feQ.MeCl,.12H.O and 3MgO. MeCl..11H.O 
NH,CI was added to the solution prior to the titra- | as closed circles. are believed to form in mag 
tion The MeCl content was calculated from the oxvechloride cement The two comp 
chloride determined by titration with silver nitrate | 9\f{oQO.MeCl,.6H.O and 9MeO.MeCl..14H.0 [s 
solution, using a second portion of the paste dissolved also shown. The composition of the pastes 
in nitric acid solution [7 The MgO content of the | with the six samples of magnesia, 40MgO:40Met 
active magnesia used in making the pastes was also | §H,0:20H.O by weight. was selected for stud) 
determined by titration. The heats of solution were | consideration of Chassevent’s data [2] in the 
calculated on the basis of the MgO content The that such pastes would develop the maximul } 
heats of hardening, also expressed in calories per gram | of hardening. The composition is very nearly 
of MgO, were determined by subtracting the heats | of the compound 5MgO.MgCl.12H,0, the { 
of solution of the hardened pastes from the heat of | form in the hardening paste 2, 9] 
solution of the magnesia, corrected for the heat of The directly measured heats of hardening of 
dilution of the MgCl, solution contained in the flooring cements open circles) and of the Met 
paste MeCl,-H,O pastes closed circles) are show! 
3. Materials figure 2 and given in table 2. Excluding magnes 
sample 3, which had a very low surface area, 
Nine samples of commercial magnesium oxy- | heats of hardening of the pastes and of the floo 
chloride cement were obtained, representing the 
products of seven manufacturers These samples 
were chosen to represent the material made e¢om- 
mercially by the larger producers. Magnesium 
chloride solutions to be used with their product were 
supplied by the makers, either as solution or as 
flakes, MgCl,.6H,O, with directions for preparing 
solutions of the proper concentration 
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Loss at 
W ater 
Loss 0 
CO, 
Bulk density 
Surface 2 
Hydration in 24 hr Cl, 64,0 ———»—___» 
20 
Expansior . 
URE | Composition of magnesium oxychloride pa 
cements 
* (rained weight 
From loss at 450° ¢ . Weight-percent compositions of the pastes prepared for heat< 
By difference determinations; @, composition of the cementitious material in the co 
4 By nitrogen adsorptior flooring cements; @, composition of compounds that have been identif 
¢ By ignition loss of 2-¢ sampk xed with 2 and allowed tos | system MgO-MgClh-H;0. The numbers attached represent the mola 
f Mixed in the weight ratio 1:19 wit! tl i ceme al edt rden for tions of MgO, MgCls, and HzO that have been assigned to the cor 
24 hr, and autoclaved 3 hr at b/ir Letters near identify the samples for easy reference in the text and fig 
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were of the same order. Both sets of samples 
a wider range of 18-hr heat of hardening in 
um flasks than in the conduction calorimeter. 
a samples 1, 5, and 6 all had about the same 
area (table 1 5 and 6 were 
o be magnesias used for oxychloride cement. 
ree each developed 180 cal/g of MgO in the 
flask compared average of 193 

MeO for the flooring cements, and an 
of 2083 eal/g of MgO in the conduction 
ter compared with an average of 177 cal/g 
) for the cements. There is some indication, 
ed lines in figure 2, that the heat generation 
er with smaller percentages of MgO in the 
tious material However, as the cements 
roducts of different manufacturers, the 
as used in their preparation were probably 
t. and, as the closed circles of figure 2 show, 
dental variation of the heat of hardening 
different samples of oxychloride magnesia 
ecount for this apparent correlation. The 
ents showing the higher heats of hardening, 


and samples 


with an 
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MgO IN CEMENTITIOUS MATERIAL , PERCENT 


magnesium oxrychlor de 


2. Heats of 


pastes and cements b lirect measurement 
i / ‘ 


hardening of 


ta 1 in the conduction ealorim- 

m flask , Values obtained for 

ed with MgO-MgCl:-H:0 mixtures 

lable magnesias rhe cements 

en by the manufacturers; the 

pared at the same composition, 40 percent of 
0 percent HsO , Change with in- 


g of the various cements, calculated 


Presumably similar 
The four 
points representing these two cements are in excellent 
agreement with the dotted lines of figure 2. This 
agreement 1s to be exper ted because these are points 
far enough removed from the rest to be important 
Taken by them 


figure 2, are of the same brand 


magnesias were used in their preparation 


in giving direction to the curves 
the two cements show a 
taken as additional 


selves, however similar 
trend This agreement may be 
evidence for the correlation indicated by the dotted 
The variation of heat of hardening with MgO 
demonstrated more fully in the 
heat-of-solution measure- 


lines 
content will be 
con erned 


section with 


ments 


should be pointed out that the conditions of 
heat flow in the tests made with the vacuum flask 
approximate the conditions in mass concrete. There 
has been considerable interest in massive structures 
made from magnesium oxychloride cement. The 
conduction calorimeter simulates approximately the 
conditions in a thin slab of concrete where heat flows 
to the surroundings and excessively high 
likely to occur In figure 3 
time-temperature curves of 
hardening in a 
commercial 


readily 
temperatures are not 
are shown typical 
magnesium oxychloride 
vacuum flask. Curve B 
flooring cement and curve A the magnesium oxide 
with which the cement was prepared. Magnesia 
sample number 5, tables | and 2, and cement sample 
table 2, are represented by these curves 


cement 
represents a 


number | 
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Similar curves obtained with the conduction calorim- 
eter are shown in figure 4. The maximum tempera- 
ture was reached from 3 to 5 hr later in the conduc- 
tion calorimeter. This delay that the 
much higher temperatures reached in the vacuum 
flask accelerate the hardening of the materials 

The compressive strength of magnesite cement is 
reduced by high temperatures during hardening, 
such as would occur in the interior of mass concrete 
In figure 5 are shown the relative compressive 
strengths of 2- by 4-in. cylinders that were allowed 
to harden for 24 hr enclosed in vacuum flasks. The 
cylinders were then removed and stored at 25° C in 
the laboratory. Companion specimens were allowed 
to harden without protection from loss of heat and 
were similarly The 7-day compressive 
strengths of the protected specimens, calculated as 
the percentages of the strengths attained by the 
unprotected plotted against the 
maximum temperatures reached by the protected 
specimens. The cements are those listed in table 2, 
except for sample 4 Reduction in strength occurred 
for seven out of the eight cements and amounted in 
65 percent The dotted line 


indicates 


stored 


specimens, are 


one instance to 


Fy 


f 


was calculated by the method of least squares 
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URE 5 


lhe strengths of 2- by 4-in. cylinders allowed to harden in a vacuum fils 
uted as the percentages of the strength of companion cylinders wi 
igainst loss of heat, are plotted against the maximum temperature atta 
rage conditions: Protected cylinders, 24 hours 
and 50-percent relative humidity; unprotecte 
The position of the 
Point A, representing 
was omitte t 


vacuum flask. St 
lask, ¢ 


days at 25° C 


lays at 25° C d 


and 50-percent relative humidity 


rom which cement B (sample |, table 2) was made 1 from 


Effect of hardening temperature on the com} 


> : , ; 
strength of magnesium oxychloride flooring cemer 


MOm™m 


YNAMK 





epresents the equation calculated from the 


the method of least Point A. 
s not included in the calculation, represents 
sample 5, and po B represents flooring 

These are the samples represented by 
B of figures 3 and 4. The cylinders 
A disintegrated in 5 days: 

I therefore was 


squares 


and 
d by 
\ compressive 


point 
stre ngth 
s zero 
mic method of determining the dynamic 
ol elasticity / was used to follow the 
of the hardening of 1- by 1- by 11%-in. bars 
paste prepared with magnesia sample 5 
Curve C, figure } shows part of the results 
such series of measurements The bar 
suffic iently in 2 hr to be removed from the 
measurements begun at once. The 
of elasticity at 2.2 hi 260,000 Ib/in 
t had increased to 920,000 Ib/in and at 
240,000 Ib/in.’, three-fourths of the 
vr. The most rapid increase in / occurred 
period of rapid of heat evolution 
by curve A of figure 4, although sufficient 
had developed to enable the removal of the 
from the mold about hr before the rapid 
The early and rapid 
modulus that is a substantial 
is characteristic of mag- 
widely differing 


Was 
about 


ncrease 


became evident 
ent of a 
of the final value, 
oxychloride pastes of 
tions. 
ficient data are available in this work with 
mmercial cements and magnesias to 
the effects of temperature of hardening 
variations of the properties of the magnesia 
k does show, however, that the heat of hard- 
f ordinary magnesium oxychloride pastes and 
sis about 190 eal/¢ of MeO (about 50 calories 
am of cement) at 18 hr, and that temperatures 


active 


the boiling point of water may be reached in the 


of a mass of magnesite cement. This quan- 
heat is perhaps five or six times as much as 
be developed in the same period in an equal 
f portland cement concrete. Because the 
ride cement is usually less dense than con- 
om three to four times as much heat would 
eloped in a given volume of oxychloride 
as in the same volume of concrete. 


4.2. Heat-of-Solution Measurements 


Lematic study was made of the heats of solu- 
mixtures of magnesia and magnesium chlo- 
itions. Magnesium oxide sample 6, table 1, 
| with solutions of MgCl, of analytical reagent 
The compositions studied are shown as 
open circles in figure 1. These points rep- 
he average compositions calculated from the 
; of the paste samples made when the heats 
were determined For convenience in 

samples A to E are considered as made 
solution containing 35 percent of MgCl; 
F to K, with a 27-percent solution; L to P, 
l-percent solution (22° Baume); and samples 
with a 15-percent solution. The last three 


lon 


groups of samples were prepared by adding MgO 
to stock solutions of approximately these concentra- 
tions, and deviations of the points representing these 
mixtures from straight lines through the 
MgO vertex of the diagram (fig. 1) can be due to 
analytical error only. The first group of samples 
was prepared by mixing MgO with the supernatant 
l-lb bottle of 
this 


passing 


liquid formed by adding water to a 
MeCl,.6H,O crystals The 
solution undoubtedly varied from time to time 
The heats of solution of the pastes having compo- 
indicated by the points in figure 1 were 
determined at various times after mixing. The heat 
of solution of the MgO was also measured. The heats 
of dilution of MeCl solutions in 2.00 
V HC] added MeQ per 425-¢ 
charge of determined Duplicate 
determinations 10-ml 
data 


concentration of 


sitions 


the various 
containing | ¢ ofl 
acid were also 
ising 2 - and 
quantities of each of the four These 


were plotted and inte rpolated values of the heats of 


were made 


solutions 


dilution appropriate to each heat-of-solution exper! 
ment were obtaimed The heat of the 
hardened paste was subtracted from the sum of the 
heat of solution of the MgO and the heat of dilution 
of the magnesium chloride solution. The resulting 
quantity is the heat of hardening of the 
the values so obtained are plotted in figure 6 


solution ol 


paste, and 

In this figure the pastes in a given column were 
prepared with solution of the approximate concen- 
indicated at the top of the column. Thi 
correspond to the lines drawn to 
the MgO vertex in figure 1. All the samples in a 
given row contained approximately the same pet 
centage of MgO. The rows of figure 6, therefore, 
correspond approximately to lines drawn in figure |] 
parallel to the base and at 10-percent intervals 

The heats of solution are subject to analytical 
errors from two The first is the inherent 
error in the determination of the MgQO in the paste 
Judging from the differ- 


tration 
columns, therefore, 


sources 


sample used for analysis 
ences between the results for duplicate samples for 
the same paste, the error in the determination of 
MgO was small in instances. T 
source of error was the difficults in obtaining 
transferring certain samples. Paste containing 
than 30 percent of MgO were of such consistency 
that many could not be weighed accurately and 
quickly and introduced rapidly into the calorimetet 
Consequently, these pastes were exposed to the 
atmosphere longer than No entirely 
satisfactory technique was for handling 
pastes of soapy or greasy consistency. As a result, 
there was some uncertainty in the weight of the 
calorimeter sample and in the correspondence of the 
calorimeter and analytical samples 


most he second 
and 


less 


desirable 
dey ised 


It will be seen from figure 6 that the greatest part 
of the heat of hardening was developed in a short 
time. Figure 6, N, shows a sample, 30-percent of 
MgQO-21-percent of MeCl, solution, that at 1 day 
had developed nearly SO percent of the 60-day heat 
of hardening. Sample O developed nearly 95 pet 
cent of the 60-day heat of hardening in 2 days \ 
sample of approximately the same composition as 
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ompositions of tt mpl letermined by ys ¢ indicated by the corresponding letters in figure 1 [These pastes were prepared 


Ficure 7 Heats of hardening of magnesium orychlor 





The €0-day heats of hardening, taken from figure 6, are plotted aga 
entage of MgO in the paste. The percentage figures shown op posit é 
ning of each curve give the nominal concentrations of the MgCl: s 
which the pastes were prepared, 


60-DAY HEAT OF HARDENING, Col /g MgO 








«0 
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developed 240 eal/g of MgO in 4 yr The 
60-day heat of hardening of the samples 
ed in figure 6 is 225 cal/g of MgO. These 
d the curves in figure 6 indicate that after 
few days heat generation in magnesium 
de cements is very Attention is 
» the curves representing the behavior of 
\, B, R, and S The values obtained for 
of solution of these samples indicate that 
of hardening first and then de- 
ith the elapse of time The authors can 


slow 


Ln reased 


t offer no convincing explanation of this 
d behavior, if the apparent trend is real 
three of these samples were 1n the composi 


where it was difficult to transfer the paste 

vely because of their consistency, errors in 

from this cause are not believed to be large 

o cause deviation from the expected trend 
ienitude shown here 

-dav heats of hardening from figure 6 are 

figure 7, plotted against the percentage of 

Here is definite evidence of a 

the heat of hardening 

of the MeO content Three of the curves 

7 show maxima in the neighborhood of 20 

of MeO. These curves show that the 60-day 

hardening of mixtures containing large 

; of MgO is less than the heat of hydration 

MeO to Meg(OH),, 224 ecal/g of MgO It has 

wn, {9} that unreacted magnesia may re- 

magnesium oxychloride pastes after harden- 

he pastes studied here were in sealed con- 

so that no hydration subsequent to harden- 

| be caused by moisture from the atmosphere 


the pastes 
of those pastes as a 


as apparently occurred in the work cited. X-ray 
patterns of certain of these pastes also indicated that 
MgO was present. The presence of unreacted mag- 
hesia in the pastes made with the more concentrated 
magnesium chloride solutions could account for the 
low heats of hardening found with the larger amounts 


of MeO 
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l. Introduction 


s methods and devices have been 
the measurement of the stiffness of 
extiles, and highly flexible materials, and 
has been defined in various ways. Most 
ffness testers now available measure stiffness 
bitrary and relative manner, evaluating such 
es as the force required to bend a strip of 
hrough a certain angel to deflect the end 
determine the length on 
act to produce a given 


man 


ad tor 


or 
amount some 
viven must 
Stiffness of paper Is therefore usually defined 
moment, angle, or length 
| produce a given effect specimen of 
mensions strained in a specified manner. A 


force 


some fore e 


as 


on a 


ol various types of stiffness testers especially 
for paper testing is contained in an article 
rk [1] 
sometimes defined according to the 
m the strained member is expected to perform. 
neering mechanics the stiffness of a beam is 
ed by the load that it can carry with a given 
In this sense the stiffness depends on 


is 


nm 
beam is supported and how it is loaded. 
other hand, various flexural properties may 
fically defined so as to be more or less inde- 
of dimensions and conditions imposed on 
essed member 
field of textiles Peirce 
etail the problems and theoretical difficulties 
ilating the stiffness and related properties of 
He measured the stiffness of textile fabrics 
ending specimens as cantilevers bending under 
wn weight and expressed the results as (a) an 
bending length equal to the length of the 
n multiplied by a somewhat complex function 
a quantity called flerural 
defined the bending moment for unit 
re per unit width of the material, and (ce 
) modulus, or intrinsic stiffness of the material, 
tional to “flexural rigidity’’ and inversely 
tional to the cube of the thickness 
fer [3], with his Flexometer, evaluated flexural 
property closely related to stiffness, and also 
By means 


2] has discussed in 


bending angle, (b 


as 


exural properties of textile fabrics. 
rated springs, a measure was obtained of the 
necessary to fold the specimen back on itself 


aS the ier 


until a certain minimum angle of fold (angle between 
the tangents al the two ends of the specimen 
attained, the minimum angle being proportional to 
the thickness of the material tested. The flexural 
work was then evaluated from the torque and angle 
Measurements were also made on paper. 
in a discussion 


was 


readings 

The Institute of Paper Chemistry 
preliminary to the study of certain commercial paper 
stiffness testers, defined some flexural properties 
related to stiffness [4] Flexural rigidity de- 
fined as the bending moment required Lo produce 
and rigidity was defined as 


was 


unit curvature of bend 
the flexural rigidity per unit width, or the bending 
moment per unit width required to produce unit 
curvature of bend 

Clark [1], in formulating for 
obtained with his apparatus, in which the specimen 
is disposed cantilever bending under its own 
weight through large angles, defined stiffness as the 
cube of the critical length multiplied by an arbitrary 
constant. Stiffness appears be proportional to 
the modulus of elasticity and the moment of inertia 
and inversely proportional to the weight per unit 


expressions data 


as a 


to 


area [4] 

Sharman [5] measured the stiffness of paper with 
a pendulum damped by a flexing paper ring, and 
defined stiffness the bending moment per unit 
width that produces unit curvature. He pointed 
out, however, that the modulus of elasticity has 
different values in various directions in the sheet of 
paper, and that it is necessary to evaluate a stiff- 
for and a stiffness for 
cross direction 

Abbott [6] stiffness of fabrics, 
had been rated subjectively by experts, by 
of several methods and found that measurements 
made with an apparatus similar to that of Peirce [2] 
as flexural rigidity, agreed best with 


as 


ness machine direction 


which 
means 


measured the 


and expressed 
the subjective ranking 

‘flexometer”’ 
cantilevet! 


specimen by and the 
reaction force was measured by means of a sensitive 
Stiffness was expressed as 
calculated by 


Hebeler and coworkers [7] devised a 
with which a torque was applied to a 


means of a rotating clamp 


electronic vage 
bending modulus 


the elastic modulus 
means of the conventional! equation for a cantilever 


beam loaded at the end 


strain 
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Ficure |! Schematic drawing to illustrate the princi ple of the 


. tester 
stiffness teste 


2. Testing Instrument 


The principle of the apparatus described herein 
was outlined some years ago by the authors [8] in 
connection with an attempt to use stiffness as a 
measure of artificial wear of currency paper as a 
result of repeated crumpling. A test was required 
that would reflect the continuous deterioration of 
the paper resulting from the crumpling treatment 
However, tests made with a crude, but adequately 
sensitive, model showed that the stiffness did not 
decrease continuously, but actually increased during 
the early crumpling treatment, because of the cor- 
rugating effect that increased the effective thickness 
and the moment of inertia in bending. This idea 
for the evaluation of the stiffness of paper, not 
being applicable to the problem, was laid aside for 


a time, but more recently was reviewed and further | 


developed 


Figure 1 is a schematic drawing intended to illus- | 


trate the principle of operation, figure 2 shows the 
finished apparatus, and figure 3 shows the bent speci- 
men in relation to the clamps. A clamp, C, is 
suspended between two lengths of piano wire, W, 


Ficure 3. Diagram showing the bent specimen held 
end in a rotatable clamp, the torque and reaction ares 


the bending angle, @ 


(about { mm in diameter), the outer ends of 
wires being fixed at points A and B to a pivot 


frame, F. The wires are fastened to the clamp 
line with the clamping edge. 


t} 
i 


One end of the paper 


specimen, S, is fastened in the torque clamp, ‘ 
whereas the other end is held by a similar clamp, k 
In operation, a bending torque is transmitted throug 


the clamp, C, while clamp, K, applies the reacti 
The latter clamp is also mounted with its 


; 


axis 


through the front edge. This axis corresponds | 


axes, so as to allow the specimen to bend free! 
naturally, being restrained only from displace! 


about the axis WW. 


The pivots of clamp, / 





the free end of a cantilever and must be perfect 
free to turn, or to move in the plane of the 


eely in slots in the supporting bracket; link 
rts the clamp 
torque is applied by rotating frame F, to 
ie outer ends of the piano wires are attached. 
jue, transmitted through clamp C, bends the 
rough an angle 6 (fig. 3), shown by the posi- 
the pointer P, which is integral with clamp C, 
fixed scale, D (fig. 1 Simultaneously, the 
s measured by the position of the pointer 
torque scale, T, attached to the pivoted 
F, that is, by the angular displacement of 
with respect to the torque frame, F 
pparatus Owes its sensitivity in considerable 
to the design that eliminates the need of 
in the torque measuring device. Further- 
ecause the bending axis is vertical, the meas- 
is uncomplicated by a gravity component 
crease the range and usefulness of the instru- 
t, a means is provided for varving the length of 
specimen, that is, of changing the distance 
veen the two clamps The bracket that supports 
ction clamp, K, moved along the 
tal scale to the rear (fig and clamped in 
position. This adjustment allows the effective 
eth to be varied from 1% to 12 em. This detail 
t shown in figure 1. The width of the specimen 
iso be varied. The maximum width that can 
mmodated is about 7 em. Another obvious 
ns of broadening the range of the instrument is 
ve the diameter and length of the piano wires 
{nother detail not shown in figure 1 is a means of 
¢ the zero adjustment. Paper is seldom per- 
flat, but usually has a little curl. As a result 
inter will usually not register zero on both 
s after the specimen has been clamped in place 
hould set the zero of the torque scale under the 
then move the fixed scale, D, (by means of 
er at the right in fig. 2) until the two zeros 
pointer coincide 


can be 
») 


ake the test, the torque is applied by rotating 
rame, F, and the torque scale, T, attached to it 
aper is bent, first to the right and then to the 
a selected angle of deflection shown on the 
ale, D, and in each case the corresponding 
on the torque scale is read. The average of 
o torque readings is taken as the angular 
of the torque corresponding to the selected 

ng angle, or angle of deflection. 


This operation should be done unhurriedly, yet 


voter 


“2 thout needless delay. The readings obtained are 
imp u . 


e sensitive, but the paper will ordinarily not show 
preciable plastic deformation until it has been held 
several seconds in the bent state. A further 
tion against error due to plastic strain is to 
two readings after bending the paper in opposite 
ms, and average the two. The average of the 
orque-angle readings is converted to a bending 
t, in gram-centimeters, by means of a cali- 
n constant, which is determined in the following 


instrument is rotated around the horizontal 
of the torque scale through 90 degrees until 
res are horizontal. The pointer is_ then 
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counterbalanced with paper held in clamp C until it 
returns to zero. A small weight is fastened on the 
pointer at a known distance from the piano-wire 
axis. This, of course, causes the pointer to move 
downward. The instrument is then rotated around 
the piano-wire axis until the pointer is again hori- 
zontal. The moment increment (added weight in 
grams times lever arm in centimeters) divided by the 
angle read opposite the pointer on the torque scale 
gives the calibration constant in gram-centimeters 
per degree. This procedure is repeated with different 
added weights and lever arms, and on both sides of 
zero over an interval on the scale representing the 
range of readings usually encountered. The average 
value of the calibration constant found in this way 
was 0.092 g-cm/deg 

The torque angle, read on scale T, multiplied by 
0.092 gives the bending moment in gram-centimeters 
for a specimen of a chosen width and length (distance 
in centimeters between clamps) and for the angle 6 
read on scale D) through which the specimen is bent 
The bending angle, @, is the angle between the chord 
through the two axes and the tangent at the torque 


> 


axis (fig. 3 
3. Definition of Stiffness of Paper 


In the study here reported the stiffness of paper 
has been defined as the bending moment per unit 
width of the specimen and per unit curvature of the 
specimen at the torque axis. Stiffness thus defined 
is the same as the “flexural rigidity’ of Peirce and 
Abbott, the “rigidity” of the Institute of Paper 
Chemistry, and the ‘stiffness’ of Sharman 

The following equation expresses in symbols the 
stiffness of paper as defined above, and shows how it 
is theoretically related to the other quantities in the 
equation for the elastic deflection of a cantilever 
beam loaded at the free end 


M VR EI Ed 
kK b h h 12 
since MR=EI 


Mi? ML 
SbF 3h 


VL’ /3F for the case assumed when 
6 is small, /=4d°/12 for a rectangular cross section, 
and, when @ is small, /(@)=L/F=1/sin @=1/tan 
6=1/@ (@ in radians) 

In eq (1) S is the stiffness, M is the bending 
moment at the torque axis, 6 is the width of the 
specimen, A=1/R is the curvature, FR is the radius 
of curvature at the torque axis, F is the elastic 
modulus (bending modulus), J is the moment of 
inertia, d is the thickness of the specimen, L is the 
span or bending length (distance between the axes 
of the two clamps), F is the deflection of the free 
(loaded) end (corresponding in fig. 3 to the distance 
from the chord to the end of the tangent that repre- 
sents the length of the unbent specimen), and @ is 
the bending angle. 

The width, 6, and the bending length, LZ, of the 
specimen can be chosen at will within the limits of 
the apparatus, and the bending moment, M, is 
obtained from the reading on scale 7 (and the 
calibration constant) for a chosen value of angle 6 
on scale D 





There are three principal reasons for choosing the 
above definition for the stiffness of paper, all of which 
are linked with the equation for the elastic deflec- 
tion of cantilever beams, inasmuch as the specimen 
bent by means of the apparatus pictured in figure 2 
appears to behave as a cantilever loaded at the free 
end, when the ‘‘wall’’ is rotated while the “‘free’’ end 
is loaded by the reaction of the “fixed’’ clamp, K 
These reasons are as follows 

] It is desirable to express stiffness in such away 
that it will be independent of the width and length 
of the specimen and the bending angle The canti- 
lever equation suggests that this relation should hold 
within certain limits) for stiffness as defined above 

2. The stiffness of paper should be so defined as to 
recognize that the fixed thickness is an inherent 
factor in the stiffness \ paper, once it has been 
fabricated, has a fixed thickness that is as much a 
characteristic of it as its composition or structure 


of angles used in the study reported in th 
Somewhat similar to it, f.(@) ean be der 
setting the coordinates z and y for large d 
equal to LZ cos @ and L sin @, respectively, and 
these values in place of x and y in the sol 
the differential equation for the radius of « 
of the elastic curve 
In this investigation the bending angk 
stricted to the range 5° to 30 Table 2 shows 
a sulfite bond paper, the stiffness, S,, S 
and S,, corresponding to the various expressior 
f(@) in table 1, calculated by eq (1 S, was on 
because it differs so little from S Each sps 
was bent successively through the various angies 
that variation in the material does not affect 
relative stiffness calculated for the various 
However, to minimize accidental errors of obs 
15 specimens were used, an 
k for each angle is sh 


Equation (1) suggests that the stiffness of paper as 
defined is a function of the elastic modulus and the 
thickness 

3. Stiffness as defined can be simply expressed in 
terms of the measurable quantities vielded by the 
apparatus shown in figure 2, and given in the last 
expression in eq (1 

The remainder of this paper is devoted primarily to 
examining the validity of the proposition that the 
stiffness of a given specimen of paper, as defined 
above, remains constant as the bending length, 
width, and bending angle are varied, with some 
consideration of modifications necessary to give a 
constant value where the simple relation fails 


‘ ' : : tion, from 5 to 
4. Stiffness With Variable Bending Angle an ain ot Mane 


in the table. 

Figure 4 shows graphically the stiffness (S 
each length as a function of the bending a 
The graphs for S, to S, are similar to figure 4, ex 


In the last expression in eq (1), which has been 
chosen to evaluate the stiffness of paper, we should 
expect the torque or bending moment to be pro- 
portional to the angle @ (or its sine or tangent). only 
when @ is small. It is not feasible to measure : 

seiieeeadiihen Fe: ; ; - : a TABLE 2. Stiffness values and average deviations 
accurately very small angles with the apparatus bend geeer, machine direction, ealevieled bs 
described. Furthermore, paper is frequently bent equation (1) and the functions of the bending angli 
through rather large angles in its many uses, and é 
it is desirable to know something about how the 
stiffness is affected at these large bending angles 
Others have realized the difficulty involved in using 
the simple expression to evaluate data obtained at 
large bending angles, and have attempted to modify 
the cantilever equation, largely by empirical means, 
to make it more useful for the larger bending angles 
At least six expressions for f(@) have been suggested. 

These are given in table 1, as well as their values for 
angles from 5° to 50 

The expression for /(@) heading the last column in 
table 1 is due to Peirce [2], and the simpler forms, of 
course, follow from the assumption that @ is small. 

The expression for f,(@) was taken from notes on a 
Danish manuscript that came to the authors’ atten- | 
tion some years ago, which apparently was not 
published. It differs little from /,(@) for the range 


If one wishes to know the intrinsic stiffness of paper as a material, a theoretical 
value can be obtained by dividing by e cube of the thickness, giving the bending 
modulus of Peirce [2] and Hebeler, et al 
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Siiffness values and ave age demations for a su 
pe machine a clion a ilated by means 
1) and the functi s of the bending angle, 6, shou 
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In table 2 the minimum deviation most often occurs 


when stiffness is calculated as S,, corresponding to 


f(@)=1/tan 6. The next most frequent occurrence 
of the minimum is under S; and S 

here is observed a tendency for the minimum 
variation to shift toward the right-hand columns of 
the table as the length of the specimen increases 
It may be suspected that this shift reflects the fact 
that a smaller range of bending angles was used in 
testing the longer lengths (see fig. 4); and this 
circumstance is recognized as a weakness in the us¢ 
of the minimum variation as the criterion of the 
most suitable form of S. The curtailed range of 
bending angles results, of course, from the poor 
precision in reading small bending moments when 
long specimens are bent through small angles. This 
shift seems logical, however, from the fact that the 
slope of the curves (which increases with length as 
shown in fig. 4) tends to decrease as the subscript 
of S increases 

Justification for the use of the minimum variation 
as a measure of suitability of the different forms of 
S was further investigated by means of another set 
of tests of the same paper at bending angles of 10 
20°, and 30° for all lengths. In these tests, in which 
all lengths were tested in the same range of bending 
angles, the results were much the same as before 
The same type of shift was shown, and the optimum 
expressions for f(@) were indicated to be f,(@) and 


is we proceed from S, to S, the slope of each 
tends to decrease, becoming negative for the 
s ibsecripts of S 
observed from table 2 that in general S, 
es with increasing 6, whereas S, decreases 
here in between there is a minimum variation Somewhat similar data for several types of pape 
stiffness values with bending angle. As a| are shown in table 3, the variation with bending 
of this variation, the average percentage | angle alone being given. The value for S, was 
on from the mean stiffness value in each | calculated only when the minimum approached it 
s given. The least value of the deviation in | closely. Each specimen was always bent through 
ngth group, indicated by an “a”, seems | the three or more angles (usually 10°, 20°, and 30 
re to indicate the corresponding form of f(@) | The tendency for the minimum variation to shift 
elds the most nearly constant stiffness value. | toward the larger subscripts of S as the length 
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f,(@) about equally often 





increases is shown for all papers tested. Table 4 
summarizes the relative distribution of the minima 


The evidence seems to point to f,(@) and /,(@), the 


reciprocal of the angle or of its tangent, as the most 
desirable forms, particularly 
Spec imens are avoided as mu¢ h as possible 


if the extremely long 


(s a matter of interest, the stiffness average (S 
for the bending angles is given in the next to the 
last column of table 3 for each length of specimen 
This affords an idea of the range of stiffness of some 
of the ordinary types of paper. The last column 
shows some values of Young’s modulus, /, obtained 

corresponding to the S values 
somewhat surprising that the small-angle 
f(@) yield stiffness values so nearly 


by eq (1 

It is 
expressions for 
constant when the angle alone is varied through such 


Data showing é osition ti eference to {(@) of 
mum variation o fIness as a function of the he nding 


the range 


tore 


Sulfite bond 
neat 


lirec 


hewn 


A ll-rag ledger,ma 
chine directior 
d =().0127 cm 


Kraft wrapping 
machine direc 
tion d=0.0140 
m 

Kraft wrapping 
cross direction 

Rope manila, ma 
hine direction 


Rope manila 
croas direction 
d=0.0178 em 


*A minimum variation in stiffness as a function of the bending angle in the 
range 5° to 30 


a large range. Hebeler and coworkers [7] rey 
somewhat similar finding for angles as great 
but no data were given. It is equally surpris 4 
the formulas do better for short specimens than { 
long ones, since the latter have smaller curvature { 


a given @ 


5. Stiffmess with Variable Width of Specime 


According to eq (1), for a given length, / 
specimen bent through a given angle, @, the ben 
moment, ./, should be proportional to the widt! 
of the specimen, in order to yield a constant val 
stiffness as the width is varied. In table 5 
group of three values for a given specimen, ha 
a common length and bent through the same ang 
shows how nearly constant the stiffness value rem 
when the width alone is varied. The missing va 
represent tests that could not be made withii 
limits of the tester. Wide, short specimens 
through large angles required torques too great 
the torque scale, and narrow strips 
through small angles produced torques too smal] 
be read accurately. The agreement is very gi 
within each group 


long, 


TABLE 5 a ffec ith of specimen on stiffness 


ed as ML/3h@ (su fite bond 
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ess With Variable Length of Specimen 


that, for a 
bending angle, the bending 


al be expected from eq civen 
moment should 
\ proportional to the length of the speci 
ler to vield a constant value of the stiffness 


rth of th 


? . 
speciinen 5 Varied ruil Wwe 


dv seen in tables 2, 3. an 5, that for a 
ng angle the stiffness 
th of specimen+=tnereased 
that the 


powell ol 


nereases some what 


It was found 
approximate ly 


specimen If 


torque Va ad 
the leneth of the 
we should multipiv the 

L we should 
alues as the 
this for the 
“he exponent ot / 
with different 


tervals, and different forms of /(@), ranging 


stiffness values in 
The arly 
Table 6 


values for 


obtain more 

length is increased 
average each 

howevel will vars 


papers different angle and 


tween 0.7 and 0.9 


7. Conclusion 


instrument designed to apply n 
variable 


SCNSILiVe 


men of paper of 


torque to a spec 


ns. stiffness measurements were made of 
varied from 


and the 


nds of paper the length being 
cm the 
n 5 to 30 
xpression & VML/36 0 
r the elastic deflection of cantilever 
stiffness (I th 
ested for G the 
t! 


width. { 2 to 6 em 


based on the 
beams 

' } 
oO evaluate several 


that 
lv found most itable for 


have been 


expressing 
1/6 and l/tan @ With 


paper seem to 
s tor f(@ the stiffness form ila lis 


reent tor a given 


angle 


Vil s constal 1? 7 pr 
width and be nding 


specimen | 


provi 1 the data are restricted to the 


his However when the specimen 


iffness values increase 


with 
ction 


al 
stuctoryv corre 


he followme modth 


in which Is ihn the n hborhnood of OLS 


The simple approximate expression 


V1 


appears to be adequate r tl aluation of the 


stiffness of mos papers 


to Norman HH 


Bureau during 


The 
Ditrick 
the summer of 1951 for 
} 


ol 


il ithors eCXPress thre ir gratituade 


student guest worker at thi 
valuable assistance in 


ytaiing data and making cale ilations 
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A Convenient Small Osmometer 
G. A. Hanks and S. G. Weissberg 


\ small, rugged osmometer that is retatively easy to assemble and us 
Typical data obtained with polystyrene solutions are presented Performance 
son with other osmometers is discussed 


1. Introduction metal or glass bridge. A horizontal mark etched on 
the capillaries serves as a reference tor capillary 
cription is given of a simple, rugged instru- | calibration. The male 10/18 standard taper at- 
r determining the osmotic pressure developed | tached to the measuring capillary fits into the metal 
ition of ahigh polymer. Othersmall osmom- | cell, which has been reamed and ground to accom- 
scribed previously [1, 2,3, 4, 5]," have been, | modate it. A mercury well assures a leakproof seal 
respects, too inconvenient to use. Although | The flat surface of the cell adjacent to the membrane 
ly easy to make, glass osmometers are fragile | is lathe-finished and lapped on an optical lap to a 
nay require frequent repairs. Small metal | mirror finish. The perforated plate adjacent to the 
neters, in addition to being rugged, can be | other side of the membrane is lathe-finished only 
ed for easy and rapid assembly and simplicity | The screw plug has three holes at 120° intervals at 
ation. The instrument described here is a | the top. A wrench with three \-in. pins, which 
cation of the apparatus described by Sands | engage these holes, is used for final tightening of the 
ohnson [4]. The new design permits the in- | assembly (fig. 2). For assembly purposes the 
ent to be assembled, rinsed, and filled con- | osmometer base fits into a solvent well (fig. 2), in 
tly, while keeping the membrane wet with | the floor of which are vertical dowel pins at 120 
at all times. The total volume of the solu- | intervals. These pins engage holes in the undet 
| is less than 2 ml so that a relatively small | side of C. The base plate of the solvent well is 
ty of polymer suffices for a complete osmotic | attached to a channel iron that can be clamped Lo 
‘curve. a work bench (fig. 2). A three-pronged brass hook 
Zimm and Myerson osmometer [5] has been | (fig. 2) facilitates transferring the osmometer to and 
the Bureau for some time. The outstanding | from the tubes that contain the solvent during the 
antages of this instrument are the ease of adjust- test The hooks fit into the holes in D 
the solution meniscus and its relative speed in 
ng equilibrium compared with other small 3. Assembly 
meters. On the other hand, its assembly is 
is and time consuming. Particularly in rinsing The assembly of this instrument is simple and 
ell, it is difficult to empty the osmometer with- | rapid. The osmometer base is placed with the holes 
risk of drying the membrane. After the instru- | engaging the dowel pins in the solvent well. The 
s finally filled with polymer solution, air | perforated plate is fitted into the osmometer base 
les must be removed from each osmometer. | Solvent is then poured into the well up to about 
ause considerable clamping pressure on the mem- | 0.5 em above the level of the perforated plate 
which serves as its own gasket) is required to | This assures that the membrane, which is put in 
an adequate seal, there is danger that the | place next, stays wet with solvent during the as- 
edges may be chipped. This is especially | sembly period. The cell is next placed on the mem- 
f the plane of the membrane is not exactly | brane. Finally, the screw plug is screwed in hand 
ndicular to the axis of the cylinder forming | tight. The wrench is fitted into the holes on the 
of the Zimm osmomete! screw plug, and a steel disk (fig. 2) is placed on top 
of the wrench. A C-clamp is used to clamp the 
assembly to the channel iron. This prevents the 
osmometer from riding upon the pins in the well 
during tightening. Final tightening is accomplished 
by using a 30-in. extension to the wrench handl 


2. The Osmometer 


smometer design is shown in figure 1. The 
portion consists of an internally threaded 
ember, a nickel-plated brass cell, an externally 
ed brass member, and a perforated brass plate 
glass portion of the instrument consists of a 
ng and a reference capillary, both cut from i. 
length of tubing of 0.5-mm bore. A male seal. There is no evidence of shearing of the mem- 
( c y yy ~- ° 4 2 ry 
L ta er (10/18) joint is sealed to one end of brane during the tightening operation The clamp 
x. TP J “hy af . lerw is removed, and the osmometer is now ready to be 
suring capillary. The reference capillary is | ¢ i 
| to the measuring capillary either by gq | filled with polymer solution. The capillaries should 
L te 7 eas gy Ci i 7 7 f : 
: ; in the meantime, have been cleaned with cleaning 


The membrane, which serves as its own gasket, is 
thus squeezed between the perforated plate and the 
annular surface of the cell. A torque of approxi- 
mately S800 in.-lb is sufficient to assure a leakproof 


rackets indicate the literatu references at the end th paper i solution 
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ly seating the capillary 


nen 
moved from the mercury well 

One must consider temperature effeets i 
adjustment of the solution meniscus 
measurements, meniscus adjustment is 1 
The osmometer ts plas ed in its upright pos 
solvent well, after which the mercury wel 
with solvent and dried (\pproximate!] 
clean distilled mercury is then placed in th 
well The assembly Is removed Irom tl 
well, and the lower portion of the capillary 
hod, proper are washed with solv: nt toren 





ial polymer solution. The assembly is th 
ferred to the large test tube in which has be 
sufficient solvent for a convenient references 

level \ small lateral hole cut through the 

level just below the perforated plate peri 
escape of au that might otherwise be trappes 
immersion Air is removed bv gently ra 
lowering the osmometer assembly in the so 
small mirror placed under the | 


large tube pel 
to see when the trapped air has been compl 


eer eA 
moved. The osmometer assembly is nov 

be placed in the constant-temperature bat} 

\ revolving carriage in which 12 osmom« 

be mounted in the constant-temperature b 

mits each osmometer to be brought into 

for measurements of the meniscus heights 


( athe tometel 


5. Membranes 


The following membranes have been used 
fully in the National Bureau of Standards 
eters 

| Vo. 600 Dupont Ce ophai é nonwat 
ety These membranes were conditioned a 
to the method of Carter and Record [€] 

2 Vi er-dr ed regenerated cellulose obtains 
the Svivania Corporation, Fredericksbu 
Conditioned according to the method of Fuos 
Mead {7] 

4 De nitrated collodion membranes In | 
these membranes, we generally followed the 
of Montonna and Jilk [S! It was found suff 
evaporate solvent from the collodion vet 
overnight rather than to use the more « 
method ol \lontonna and ilk. Sufficient 
\fter pouring out the residual solvent, the cell is | distilled mercury is poured on a large (6 


rinsed twice with the polvmer solution and the per- dish to float a 12-cem id) chrome -plated 


4. Filling the Osmometer 


forations in E inspected rpty perforations Into the ring are poured 37 to 40 ml of collod 
mav be filled by qui lv} I yy the assembly and from plastic 1Zel The lid is placed on the pet 
ridding solvent The im : must he performed and the whole assembly is le ftina dust free ho 
quickly to prevent drainage f solvent from the holes nicht unde! slight vacuum to remove solvent 
nthe cerforated plat The assembly is rinsed After overnight evaporation ol solvent the cr 
quickly a third time w solution and finally filled film is submerged in distilled water to loosen t 
The capillary assembly is then placed in position in | from the ring. The ridge is immediately 
the cell, but not firmly seated Gentle suction is ap- the edge of the film, after which the sheet ts] 
plied to the top of the measuring capillary until the | distilled water until final traces of solvent 
meniscus level is visible near the base of the capillary moved The film is then denitrated, follow 
asser bit the capillary assembly is then seated | tonna and Jilk’s technique \ small glass 
firmly. To prevent the solution from spilling over | rod attached to an air-driven motor is used t 
the top of the capillary, a clean absorbent towel is | the ammonium sulfide solution during thi 
placed at the top to absorb excess solution. After | tion procedure 
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ranes altel aenitration § by store 


water to which has been added a 

to prevent bacterial action on the film 

isfactorv method of 1 rane storage is 
freshly prepared bre » acetone 
displaced from the sl ts by washing them 

SSiVve acetone-wate m ot imereas 

concentrations, an { : are final! 


nadetr distilled watel ree 3 tone mtil 


6. Performance 


parison ol the relative speeds In attainment 
‘ equilibrium for th NBS and the Zimm 
ers was made with a solution of polystyrene 
ne The results of several measurements 


in figure 3. The f the total mem 


‘ 
of the Zimm NBS osmometers is 
| 


I ition volumes ts 


nately Ss and o th sO 
ne ratio ot thr rates ol approach to equuil hy 
the Zimm to NBS osmometers was 3.2 

rly 


branes mM these osmometers were probal 


Il 
ial in then perme ibilitv to solvent because 
e cut from the same sheet and were prepared 


tical experimental conditions It 


hat the Zimm osmometer is faster than the 
the ¢« ral more area and the 


mometer 
orientation of th rie anes more tha 


ating for the differes l volume The 


membranes facilitate mixing of solvent and 


by convection, whereas the horizontal mem 

mit convection only when the approach to 
equilibrium is from lower to higher pressures 
lations with time of the final equilibrium 
pressures amount to approximately 0.02 
s can be attributed to bath-temp rature 


s ‘“\fembrane asvimetry corrections 
| same magnituce Because of these small 


rhe 
es nm the osmott ssures no corrections 





for membrane asymmetry were made. Table 1 
shows the duplicability of results with NBS osmom- 
eters for polystyrene fractions in toluene solution 
Figure 4 shows an assembled osmometer and parts 
of the completely disassembled instrument 

The instrument successfully 
P.d Flory and associates [9, 10, 11] 


has been used by 


7. Discussion 


\dvantages of the osmometer are 
Rugged and simple construction 
Rapid assembly, rinsing, and filling of cell 
Assembly and filling time is approximately six times 
as fast with the NBS osmometer as with the Zimm 
instrument 
Minimum possibility of membrane drving 

t. Small volume of cell (1.2 ml 

5. Can be adequately tightened (membrane be- 
tween perforated plate and lapped surface of cell 
when using hvdrocarbon which 
considerable membrane shrinkage 
6. Reduced possibility of entrapment of air in cell 
7. The 


hence 


especially solvents, 


cnuiuse 


small, 
used simultaneously 


complete osmometer assembly is 


a number of them can be 
for a complete osmotic pressure determination 
§. Aeme thread on this instrument has 
angle of 29.5°, a better angle 


pressure 
pressure than the 
type thread, whose angle is HU 
9. Ease of soaking cell in large 
to minimize adsorption of polymer 
Disadvantages of the NBS osmometer are 
|. Precise meniscus adjustment is more difficult 
with the Zimm osmometet However, this is 
when the instrument is being used for 
static equilibrium measurements 
Two to three 


volume of solv ent 


than 
not critical 

4 sometimes necessary fo! 
equilibrium to develop, although this will 
depend primarily on the initial and permea- 


davs are 
osmot 


head 


bility of the membrane. The 
membrane permits equalization of concentrat 
by diffusion when the osmotic equilibriu 
proached from Osmometers wit 
branes in a vertical plane have, in addition 
tive mixing 

It has been the experience here that of 1 
types of membranes used for 
measurements, denitrated collodion is ¢ 
superior in performance. Never-dried cellu 
‘faster’? membrane but is permeable to low n 
weight polymer particles Number 600 
cellophane is ‘“‘slower”’ than denitrated collod 
satisfactorily when 


use of a h 


abov ¢ 


osmotic 


ean be used determin 


osmotic pressure of low molecular weight x 


3. Oceasional membrane buckling makes i 
to make dynamic measurements, as ts the cas 
osmometel! that supports the membrane on oT 
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Low Even Configurations of the First Spectrum of 


Molybdenum (Mo!) 


Richard E. Trees and Marion M. Harvey 


rhe analvsis of the first spectrum ol 
000 
w odd levels that have bee 


of approximat 
evels and 190 ne 


belong to the 


additional lines 
three low configurations 

lished leveis found earlier by C. ¢ 
Assignment of the terms to the 
retical calculats 
168 em between theory 
and detailed form, whicl y 
rhe theoretic calculation assumes LS-co 


tween the thre« nfig 


evel 
Kiess are 
three lov 
()ver-all agreement ts 

and experiment 
mav heip it ippivil 
stati nteractior 


iratior 


of configuratio raction Is obtains 


culties In ass m to ; 
use of the 


It 


Introduction and Summary of Results 


on Mot was started at the National Bureau 
dards more than 30 vears ago and has con- 
ntermittently here ever 
escription of the 
ograms obtained 


Recently, a 
made from 
this period with NBS 
This paper has a two-fold purpose 
eport preliminary results for the analvsis of 
even configurations from this latest deserip- 
\Lo 1, and (2) to describe the theoretical cal- 
mn of the term values of these configurations 
The molybdenum atom has 42 
the outer six give rise to the first spectrum 
electrons arrange themselves into the three 
irations, 4d* 5s", 4d° 5s, and 4d° giving rise to 
n terms, and into other configurations having 
even and odd terms. Previous work on the 
s of Mor has been published by Kiess [1] 
n }2}, Meggers and Kiess 3] and Catalan and 
aga [4]. The last paper the most 
te published analysis, which consists of 119 
and septet levels; it includes the a‘S ground 
nd all the low quintet terms, except a F and 
els a Dp and c 2B) 
he present investigation 44 new triplet and 
levels of the low even configurations have 
ientified. We are in indebted to C. C. Kiess 
mission to use some of his unpublished levels, 
include the a°F term and the levels c°D,, 
*Ds;, a F,, b°D,, and 6 °F There are now 
19 levels of 8 terms of the 4d‘ 5s? configuration, 


since 


spectrum Was 


ovel 


ographs 


electrons of 


contains 


racket 


the 


49 levels of 18 terms of the 4d°5 ana 
9 levels of 3 terms of the 4d \ com- 
plete list of the known levels of these configurations 
is given in table 1 The the 
table contiguration designation mY 
value, q value In 
table 1 the levels known prior to the present investi- 


conhguration 
contiguration 


successive columns of 
give the 
level 


term 
interval, and observed 
starred 

SOT 
2000 from the 
Slater first-order 
by Laport ana 


fatlion are 

2 Ih Mot ure 
positions calculated with 
theorv [5] from the 
Platt {6] for the d* (and d 
configurations and by Catalin and Antunes [7] and 
by Laporte 8] for the d \ theoretical 
calculation shows that these deviations can be largely 
explained by inclusion of the effect of configuration 
interaction the thre configurations 
This calculation also helped In) identifving eeortain 
levels and in predicting the locations of new terms 

The method of treating the second-order effects of 
configuration interaction has been outlined by Con 
don [10 Calculations of the matrix elements of 
configuration interaction for configurations with d 
Ufford [11] and 
comparisons with experi 
made in spectra with 3d and 4s 

Ufford’s caleulation for Zriu 
only detailed comparison be- 
experiment in with 4d 
and 5s this calculation that the 
effects of configuration interaction are very strong 


observed term values more 
than 
the 


riven 


cm 
formulas 


conhguration 


amony lowest 


and s-e le trons have bee n made by 
12) Several 


been 


by Racah 
ment 
electrons [13 to 16 
11] furnishes the 
tween theorv and 


electrons: 


hav ¢ 


sper tra 
shows 


Aln t ‘ 
wl ! 





1 tebe, 
20130 
MVPRI 
PORS0 
LAT 
20607 
994 


LT 
- Sif 


20030 
20050 


k 


‘ 


in Zru, just asin Mor. The spectra with 4 
troms are thus favorable for the study of configu 
interaction in contrast to spectra with 3d ¢ 

where effects of configuration interaction are po 


pronounced Mot 1s especially favorable bee 


its configurations eive rise to a large nun 
terms: nearly half of these already have bee 
experimentally, so that the theory can be test 
many instances 

A study of Mor also furnishes information 
polarization effects A major fact is tha 
EAL 1) correction, found by Trees to apply 
3¢°48 configuration of Fer 16] 1s shown 
applicable also in the 4d°5s configuration of 
The L(L+1) correction is an empirical o1 
corresponds rather closelv to the difference bi 
theory and experiment in the configuratior 
id/- and 4s-electrors that have been compare 
with theory [15 to 18] It represents, therefor 
part of the polarization energy that cannot 
counted for by choosing most favorable values 
radial integrals in the Slater theory 

Racah has pointed out [17] that the validit 
correction of form L(1+1) in d" (or d"s) cor 
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raat thre polarization When comptet 
t as the electro terms with 

teraction. is Slater first term desicna 
The fact that ‘ Dtaim g | ac levels wer 


LiL 


the stre 


3. Limitations Imposed by Magnetic 
Interactions 
LS 


Application of Theory to Analysis of Data 


Slater theorv. extend 
on interaction, has 
nt in many cases, dul often 

experimental anaivs l the theore 

on, the Slater integrals ar regarded as 

rameters and are evaluates to obtam 

nt with the know1 n the spe Mae 


im number of terms m be known expe} r , 
is niso show 
before the calculation e¢: be starter and 7 ‘ 
ana pred f 

nimum is larger the more parameters t 
evaluated in the theory In practice 


minimum numb of terms shou 


and tor ¢ 


experimentally at naccuractes 
will tend to compensal each other 
theoretical estimate & 1 positions ca 


ore accurate by ap hy c the L(L 


to the Slater theor ‘ ising this cor 


5 isually possible to Dl hit t he positions oO 


thin limits of 100 cn so that the search ( 
onsicderi 


iles 


levels can be concentrated to a oarrOwel between 26400 and 27 
of wave numbers than is usually possibl levels belonging to fou 
theoretical caleulation, including configura — Be i a 

. Shee ies ae S 0 t between levels, some ! i hnimments ¢ 


detail in section 4 g sharing 


J=4 


erac ) s «is ssc " 
wtion Is discu ad sin terms are me 
between the vel . and 


the g-sums 
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The mixing of levels due to magnetic interaction | observed terms that enter into the 3X3 > n 
affects the way thev combine, so that the combina- The last three observed values were correctec 
tion properties also indicate qualitatively the pres- | effects of configuration interaction before 


ence of the magnetic interactions. Thus the 6 °F; | in the analysis 
and 4 °G. levels and also the 6°F, and 6 °G, levels The value G,=1,795 is taken as one-sixth 1 
combine in a verv similar manner with most of the | ration between the a’S and a°S terms. TT] 
same odd levels, and the a'S, term combines with | terms are usually well separated from oth: 
most of the levels with which the a *P, level combines that can interact with them magnetically 
Mlanv combinations with Z-values differing bv | magnetic interactions present will have a n 
2? or 3 and milutiplic ities differimg bv 1 or 2 were effect on the value of GQ, because of the lare« 
observed with the few well-identified odd terms that | separation of the *S and °S terms. It is a 
are known Little has been done tow ard HsSsizgnipy that Gy, values obtained from the separations 
L- and S-values to the majority of the odd levels. | of terms based on the same d° parent will | 
When that phase of the analysis is carried out, more | sistent in the absence of configuration inti 
assistance may be obtained in identifving the even | and magnetic effects [16] 
levels by considering combination properties. It is The 10 observed values are then used in 
probable that the analvsis of the odd levels will be squares calculation to get the best values of ra | 
magnetic and | and @. It should be pointed out that if onl 
with seniority number 5 in the d@° parent ter 
been selected, a could not have been evaluat 
4. Calculated Term Values cause its effect could be absorbed in the valu 
B, and ¢ The details are closely similar to 
d°«%* | pointed out for the terms with semority numb 
the d* and d® configurations [18]. Six of 
terms used in the parameter evaluation have pa 
terms with senioritv number 5, and four have pa 
terms with seniority number 3 
In table 2 the term values are calculated wit] 
the L(L+1) correction in the two columns hea 
1); the mean deviation between theory and ex] 
ment is 416 em In the columns headed 
the calculation is given with the L(+-1) correct 
and the mean deviation is reduced to +170 en 
The latter agreement 1s probably as rood as cal 
expected when magnetic mteractions are neglect 


as the discussion in Appendix 2 indicates 


complicated by the presence ol 


configuration interactions 


The matrices for the energy of terms in the d 
and d‘s* configurations are given in Appendix | 
The elements are simple multiples of quantities 
tabulated by Racah [12]; additional details are 
given in reference [15]. The classification of terms 
in a configuration d" must be amplified to distinguish 
between terms with the same S- and Z-values, and 
Racah’s classification in terms of seniority number 
has been used in setting up the matrices of Appendix 
l the prefixed subscripts indicate the seniority 
numbet A fact of importance in understanding 
this classification is that a term in the d@° s configura- 
tion is usually characterized by a single seniority 
number that differentiates terms with the same 
S- and L-values. Terms of the same S- and L- PaBLe 2.— Term values in 4d° 5s configuration of 
values in the d‘ s* and d° configurations are not char- 
acterized by a single senioritv number because there 
is a nonzero, nondiagonal element in that part of 
the matrix of Appendix | that refers to the single 
configuration being considered. The L(1+1) cor- 
rection [16] has not been included in these matrices; 
to include ita term aL(L-+-1) must be added to each 
element in the main diagonal 

The method used to evaluate the parameters i 
considerably simpler than the usual method [13]; i: 
has been given in detail with the idea that it may 
help in future applications of the theory to the analv- 
sis of experimental data. Although the procedure is 
probably generally applicable, it is justified chiefly by 
the good over-all agreement obtained in the final 
result 


4.1. Radial Parameters for the 4d° 5s Configuration 


In the d° s configuration the five parameters, A, P, 
(’, G,, and a, must be evaluated. This is done with 
the seven observed levels that are not perturbed by is assumed that the experimentally observed t 


. . of e matr nondiagonal elements of t 
configuration interaction, and the three additional A. from considerations similar to th 
. - “— rected observed values are then the diagonal element 
*A paper by N. Rosenzweig hvs eV 1952 has just appeared lead to the assumed eigenvalues The correction 
hat gives the d*—d's matrix elemer yme of his elen Ter in sign fror assumed for the interaction parameter H 
i f phase cc j t much different from the final evaluation H 357 


ted terms in the analysis will not reduce t! 


rt 
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arameters for the 4d'5s* and 4d" Configura- 
ions and Positions of Unperturbed Terms 


isually assumed that the parameters are equal 
onfigurations of the same atom when they are 
by radial integrals having electrons with the 
and d-values; this is the assumption made by 
11}. To obtain the full agreement possible 
e L(L+1) correction, it Is necessary to assume 
he parameters are independent when they 
n different configurations. To reduce the 
of parameters that must be evaluated, it 
otherwise have been preferable to use the 
sual assumption 
wo approximations have been presented in 
ce [13] 
the three terms ¢ °D, ¢°H, and ¢ ‘Il are known 
configuration; these are the terms that enter 
e 3&3 matrices and are used to evaluate the 
meters A, B, C, and a In the d*s 
on the corresponding terms a°D, a*°H, and 
also known, and these three terms of d'‘ s? 
ised to evaluate the fou corresponding d's 
The method of correction used for the 


IS also 


Considerations relating to 


con- 


eters 
n the d 
n these 
tion the three terms « 
wo independent parameters each of whi h Is a 
B, ( and a 
therefore be 


s configurat ion (see footnote 5 


configurations In 


hosen can be calculated 


two either con- 


combination of parameters A, 
confirmation of the theory can 
ed at this stage of the calculation, since two 
eters are used to explain three observations 
ist-squares calculation is given in table 3. The 
deviation for the d's? configuration is 128 
and the mean deviation for the d° configuration 
cm 


Ve 


obtain values of all four parameters it was 
ssary to make two assumptions. The first of 
s that a=38 in the d‘ s* and d* configurations, 
makes the value of @ agree with the value 
vy determined in the d° s configuration [17, 18] 
se the L(i+1 
the d‘ s* and d* configurations [18], it is hard 
a general procedure for evaluating a@ in these 
rations. In Mo1, however, the fact that a is 
makes errors in evaluating it less important, 
this first assumption should lead to little 


correction does not apply as 


erro! The second assumption made was that the 
ratio of F; to F, (or of B to ¢ 
ratio already determined in the d 
This assumption should also be fairly rood because 
spectra of the Fe 
group of elements [13 to 16] the ratio F,/F, is fairly 
and is usually within the limits 0.068 
despite the fact that the different config- 
urations occur in different atoms and that the as 
sumptions used in the calculations differ slightly 
The value of the ratio already obtained in the d’ s 
configuration of Mo 1 (F,/F 0.0714) agrees with 
the value of the ratio found by U fford 11) am the 
@ and ds configurations of Zriut (F,/F,—0.0715 
this is confirmation of the constaney of this 
ratio in spectra with 4d electrons 

In the columns of table 4 


is the same as the 
s configuration 


In published calculations for the 


constant 
0.003 


some 


] 


headed (1) are given 


the positions of the terms calculated with the param 


eters evaluated in this and section 4.1 The mean 
deviation of 2391 em will be largely explained 
in the following section the effects of contig 
interaction are considered Besides indi 
cating the magnitude of the configuration inte! 
action, the unperturbed positions of the terms de 
termine the configuration parent of the ob 
served terms (whenever either of these is uncertain 


whi 1 


uration 


and 
as outlined in section 4.4 


4.3. Interaction Parameters: Term Values With 
Configuration Interaction 


are required to define the ele 
interaction Two param 
representing between the d 

and between the d° s and d‘ s* configurations 
radial integrals /7/,, single 
functions identical n and / 
values, these parameters therefore 
sidered equal. The third interaction parameter 
Gr, originates in the interaction between the d' 
d's 
parameter G, already determined in the d 
since the two parameters are defined sim 
ilarly With these assumptions, only one param- 
ete! FH, is needed to include the effects of config 
uration interaction in the calculation When more 
terms are known experimentally, it may be possibl 
The 


known at 


| hree 


ments of 


paramete! : 

configuration 
eters interaction 
and d, 
are defined by 
electron 


Ww hose 
wave have 


and were con 
and 
configurations and was assumed equal to the 
config 


uration 


to check these assumptions agreement ob 
tained with the terms that ar 
does not seem to depend very criti ally on the values 
assumed for the radial parameters in the interaction 
elements between cerms of the d°® and either the 
ds or d' s* and the assumptions are 


really approximations for thi 


present 


configurations, 
values of the two pa 
rameters entering into these interactions 

The parameter //, can be evaluated, so that exact 
agreement with any one experimental value is 
obtained by finding linear formulas for the eigen 
matrix [13 the iteration procedure Is 
finding the eigenvector needed in 


Because //, is determined 


values of the 
convenient for 
making this calculation 





P4850 
S060 


33700 


most accurately from that show large determined in this and the two preceding s 

effects of configuration inter: it was evaluated | The mean deviation between theory and exper 
for only such terms The i // dete rmined is 273 em : by omitting the a P term, 1t Is 
from a°G, a*D, a*P, a*t id ! 308, 362, 286 em The 46°F term also shows a large erro 

358, and 350 em™~', respectively The consistency of | may not be magnetic in origin; the source of this 
these values is a very good check on the theory, | is unknown lf this term also is omitted, the 
partr ularly when it is noted that the low value for deviation for the other 27 terms is reduced to 


‘ 


t *P can be explained by consideration of magnetic cm which is the mean deviation most rep! 
effects (Appendix 2 If the term a*P is omitted tive of the over-all agreement obtained 
‘average of the other four terms leads to the value 

357 (1O8H/ 1130 was used in the calculations 


The columns of table 4 headed (2) give the term 


values calculated with configuration interaction 
included, by using the values of the parameters 
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determine experimentally the configuration, or, it 
the case of ds terms, the parent to which a term 
belongs The configuration and th parent term are 
specified theoretically in such cases by naming the 
terms of same S- and L-value, so that the order is 
the same as that of the inp turbed term val 

This method of classification depends on th 
t} 


assumed for 1 parameters in caleulatin 


perturbed positions the terms, and 


terms have unperturbed positions that 


al 
together, there mav be some indeterminacy 
hamming ot terms accord ng to this: ile Kon Instance 


the a°D and 6°D terms are separated by 


and 1 a°*P and terms are separated 


‘ 
o 


v 
oOo} 


cm | ( j ! robeak positions 


ol magnitu 


P . 
Vv iw 


viation of the final calculation 


see eee 
oaaorcolicoic 


Another shortcoming of this method of classif 
cation is thi name of the term may have litth 
physical sign an When the configuration mntet 
action 1 Ci { aominant component ot thre 

this need not be the 
conhgurat nteraction > rong 
compositions of the three lowest 

thre lowest vels ar wiven m 
entatiy amy will be noted 


that | } classificat on Wwe li 1 I pond 


th dominant component I ely iInetior thr 
assignments ! 1) nna ; nl icp a P and h 


would b 


generally ari \ en Inperturbed 

well separated, 3 hie ‘ for ¢°D and ¢ 

Is eXp ected that ny a term trom th dominant 
component ol the « genfunetion depends too critically 
on the values assumed for the radial paramet 


' 


UNPERTURBED PERTURBED 
make this method of classification any mor 


POSITION POSITION 
cant \ determination of th wenvect 
terms would entail considerable add 


this is another disadvar tage ol 





Eff 


best confirmation of the theorv comes from the 
at seven of the eight observed values not used 
rmine the parameters were predicted within 
of twice the mean deviation (that is, within the 
536 cm Several other levels used in 
iting parameters could have been omitted with- 
v loss in accuracy, as the preliminary calcula- 
howed The calculated term values under the 
gs (1) and (2) of table 4 are compared graphi- 
n figure 1 (only those terms that have been 


mentally observed are shown 


4.4. Classification of Terms 


ndication of the importance of configuration This work has been 
ction is obtained from the value of the param- | Section of the National Bureau of Standards. W. F 


i] The value of this parameter, 357, sec \leggers Chief of ihis Sec on has given it his whole- 
C. C. Kiess, who suggested this 


4.3) 1s large compare d to values so far found hearted support 
The largest problem, has riven gene rousl\ ol his Line ina valu- 


earried o he Spectroscopy 


ctra with 3d and 4s electrons 


mW {13)). The strong able advice throughout the course of the work It 


il 172, is found in Ti 
iration interaction in Mo 
mixing that it is not possible In Many cases to interest and expert guidance 
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5. Appendix 1. Matrices of Electrostatic Repulsion 
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} 


} 
taking into account the separations of the possible inter 
levels and the g-values A fairly complete and cor 
estimate of the effects was obtained which indicat 


Appendix 2. Spin-Orbit Interaction netic effects were made to explain the observed int 


The matrix element of the a*D—a'*P interaction is 


1.031 t(d 065 ¢(d agreement was not due to accidental cancellation of 
» However, the magnetic interaction is fairly strong 
for the levels with J=2, and 0.745 times this value for those generally, agreement between theory and experiment 
with J=1 rhe value of the radial parameter for spin-orbit than +100 cm—' would have to be considered accid 
interaction can be determined wit! 
configuration from the over-all separation of the a ‘5D levels 
By including a correction for the effect of configuration inter- 
action, this leads to a value ¢(d* s 715 em! (The value 
of ¢ in the d* configuration cannot be easily determined in 
this way, but indications are that it is considerably smaller 6. References 
If the value 715 em~' is used for the parameter in all config- 
urations, the matrix element between the levels having J/=2 ». C. Kiess, BS Sci. Pap. 19, 113 (1923) 8474 
is 987 cm~', and the element between those with J—1 is 735 ‘ wT Catalin An. Espan. Soc. Fis. Quim. 21, 213 
em~'. Since these values are the order of half the observed ‘ * ) Meggers and CC. Kiess. J Opt. Gee A 
separation between the cenmagtacing pairs of levels ~ Sci. Instr. 12, 417 (1926 ; 
arger part of the observed separation is probably due to the er eae > : ailestaas . ans 
spin-orbit interaction This conclusion is well confirmed by } ~ pr aay — 4.1 ee, am § - 
inspection of the intervals of a'D and a*P and by consid- 5) J. C. Slater ‘Seen Rev “34. 1293 (1929 
eration of the g-values as already indicated in section 3 O. Laporte ‘and IR Platt Phy s. Rev. 61, 305 (1942 
he observed term values in table 4 are weighted averages ) M A Catalan and M T. Antunes, Z. Phys. 102, 432 
of the actual observed levels rhe weighting factor is 2J+1 1936) : P : ani ; 
so that perturbations of the levels with largest J value aré oO Laporte Phys. Rev. 61, 302 (1942 
most important in considering the effects of spin-orbit inter- | G. Racah, Phys. Rev. 62 438 (1942). 


fs accurs » dé 32 ‘ 
1 fair accuracy in the d‘ s certain cases, errors as large as 200 to 300 ecm™'! could r 


action. The considerations of the previous paragraph indi- E. U. Condon, Phys Res 36. 1121 (1930 

cate that the value calculated for a *P should be increased by | |11) C. W. Ufford, Phys. Rev. 44, 732 (1933). 

say 550 cm~'; owing to the 2/-+-1 weighting, the a*D value 21G Racah Phys Rev. 63 367 (1943 

would be decreased by only 3/5 of this value, or by 330 cm=! A Many Phys. Rev "90, 511 (1946). 

The error in the a *P calculation would then be reduced from |} ALA "Schweizer Phys Rev sO ‘1080 (1950 
884 cm! to 334 cm~', while the error for the a*D term 5) R. I Trees Phys Rev 83, 756 (1951 

would be changed from 94 ecm! to 236 cm! A quan- a R iE i Nay ely 84. 1089 (1951 

titative estimate of the effect would require consideration of G Recah Phys. Rev. 85 38] (1952 

the interactions with other levels that are farther away and R. I " nee Phys. Rev 85 322 (1952 

a better estimate of the values of ¢ CG Ra tah. Physi a 16 651 ‘ 1950 a 
The possibility that other large spin-orbit interactions have . . aoe, Aye dest a” 

produced an accidental good agreement has not been fully 

investigated However, qualitative estimates of the mag- 
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Methods of Conjugate Gradients for Solving 
Linear Systems’ 


Magnus R. Hestenes * and Eduard Stiefel 


An iter 


nowns 


itive algorithm is given for 
The solution is 
ral method which als« 
algorithms for 
theory of 


itit give i! U 


case of a very gen 
ire essenvially 


with the 


algorithms 


are made orthogona 


l. Introduction 


of the major problems in machine computa- 
find an effective method of solving a 
of n simultaneous equations in nm unknowns, 

ilarly if nm is large. There is, 
iethod for all problems because the goodness 
method depends to extent upon the 
ilar system to be In judging the 
ess of a method for machine computations, one 
| bear in mind that criteria for a good machine 
d may be different from for a hand 
By a hand method, shall mean one 

a desk calculator may be used. By a 

ne method, we shall mean one in which 
nee-controlled machines are used 
machine method should have 


s to 


of course. no 


some 
solv ed 


those 
we 


the following 
The method should be simple, composed of a 
tion of elementary routines requiring a mini- 
of storage space 
The method should insure rapid convergence 
number of steps required for the solution is 
A method which—if no rounding-off errors 
will vield the solution in a finite number of 
s to be preferred. 
The procedure should be stable with respect 
inding-off errors. If needed, a subroutine 
available to insure this stability. It 
| be possible to diminish rounding-off errors 
repetition of the same routine, starting with 
result the new estimate of the 


be 


as 


on 


Each step should vive information about the 


on and should yield a new and better estimate 


he previous one 

\s many of the original data as possible should 
d during each step of the routine. Special 
ties of the given linear system—such as having 
vanishing coefficients—should be preserved. 
example, in the Gauss elimination § special 
ties of this type may be destroyed.) 

ur opinion there are two methods that best fit 


criteria, namely, (a) the Gauss elimination 


contract wit! 


k was performed on a Nat 1 Bureau of Standards 
! by the 


of California at Los Angeles, and was sponsored (in part 
il Research, United States Navy 
Bureau of Standards and Univ 
ty of California at Los Angeles 
Zurich, Switzerland 


os Angeles 
echnische 


California at I 
Fidgendssische J 


409 


polynomials and continued fr 


m A 


nowt! 


otn near equations 
that this 
eluminatior 


methor 
(C;aussiar 
; 


psoid 


actions 


dimensional « 


al rn 


method: (b) the conjugate cradient method presented 
in the present monograph 

There many variations of the elimination 
method, just as there many variations of the 
conjugate gradient method here presented. In the 
present paper if will be shown that both methods 
are special cases of a method that we call the method 
of conjugate directions This enables one to com- 
pare the two methods from a theoretical point of 
view 

In our opinion, the conjugate gradient method is 
superior the elimination method machine 
method. Our reasons can be stated as follows 

(a) Like the Gauss elimination method, the method 
of conjugate gradients gives the solution in n steps if 


are 


ure 


to 


as a 


no rounding-off error occurs 
b) The conjugate gradient method is simpler to 
code and requires less storage space 
c) The given matrix is unaltered during the proc- 
so that a maximum of the original data is used 
The advantage of having many zeros in the matrix 
Is preserved The method is, therefore especially 
suited to handle linear systems arising from difference 
equations approximating boundary value problems 
(d) At each step estimate of the solution is 
given, which is an improvement over the one given in 
the preceding step 
(e) At any start by a very 
simple device, keeping the estimate last obtained as 
the initial estimate 


ess, 


an 


step one can anew 


In the present paper, the conjugate gradient rou- 
tines are developed for the symmetric and non- 
symmetric cases. The principal results are described 
in section 3. For most of the theoretical considera- 
tions, we restrict ourselves to the positive definite 
symmetric case. No generality is lost thereby. We 
deal only with real matrices. The extension to 
complex matrices is simple 

The method of conjugate gradients was developed 
independently by E. Stiefel of the Institute of Applied 
Mathematics at Zurich and by M. R. Hestenes with 
the cooperation of J. B. Rosser, G. Forsythe, and 
L.. Paige of the Institute for Numerical Analysis, 
National Bureau of Standards. The present account 
was prepared jointly by M. R. Hestenes and E 
Stiefel during the latter’s stay at the National Bureau 
of Standards. The first papers on this method were 
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ven by E. Stiefel 
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Rosset at a 
Recently, C 


based 


' 
‘ 
method were given by E 


Symposium ° 
developed a closely 
on his earlhiet eigenval lt 
Examples and numerical the 

been by R Haves, | Hochstrasser 


Lanczos * 
routine paper on 
problem tests of 
method hi 


and M. St 
2. Notations and Terminology 


paves Wwe shall bye con 


a svstem of lineat 


Throughout the following 


cerned with the probl m of solving 


form 

coefficients (a 

and h . A 
Its a 


denote the transpose of A by 


The si writte the vectol 


equations ie 
A } Here A ts the 
rand f the vectors 
It is assumed that A is 


exists Wi 


matri 
are 
nonsing erse A 
the retore 

Giiven 


two vectors and y 


t he vectol 


3 the t az 


then sum 


and a 


The 


/ / / 


where fis a st alar sum 


is theu will be denoted 


by 


fy states that fer all 


rey 


satisfy the 


relation 


matrix A 


Ay 


that is, if A=A*, then A is said to be 
A matrix A 1s said to be pes tive definite 


whenever 20 lf (x. Aw) >0 for 


If a,,=a 
symmetr i 


in case (r,Az () 


Lancs 
NAML Report 

C. Lanesos, At 
linear differential anc 
Proceedings f Second 


Machinery, Cambridge 


If A 


are said t 


to be nonnegative 


all J 


metric 


then A 
then two vectors and » 
A orthogonal if the relation 
Ag j 0 holds This is an exte nsion of ti 
gonality relation (z 7] ) 

By an eivenvalue of a matrix 
X such that Ay 
called a corresponding ergenvector 


L'nless otherwise expressly stated the n 


Is Said 


imate oF 


A is meant a 
Ay has a solution yx 


concerned, will be assy 
definite Clearly mn 
generality is caused thereby from a theoret 

because the system Ar k is equ 
the svstem Bz / where B A*A, Il A*i 

numerical point of view, the two svstems a 
rounding-ofl that o 
product A*A. Our applications 
do not involve the 


with which we are 
symmetric and pos tive 


of view 


because of errors 
the 
nonsvinmetric case 
of A*A 
Tm the sequel we shall not have occasion to 
rT] particular vector Acco 
we may use subs« ripts to distinguish vectors 
Thus a will the 
the vector (2 
a symbol is to be interpreted as a componen 
interpret 


ent 
joining 
comp 


coordinate ot a 
of components denot« 


J J and Y 


call attention to this fact unless th 

evident from the context 

T he solution of the 

h: that is, Ah=h If zis an estimate of 4 

ence 7 h Ar will be ealled the 
estimate of 4 The quantity 

The 


error vector ol 7, as an 


systen Ag k will be 
res dual 
will be 
r will be 


of h 


squared residual vector fh 


estimate 


3. Method of Conjugate Gradients (cq 
Method) 


The present section w ill be devoted to a des 
of a method of solving a system of linear e 
A jh This method will be ealled the « 
grod ent method or. more briefly, the eg-method 
reasons which will unfold from the theory dé 
in late For the moment, we shall 
ourse lve sto collecting in one plac ec the basi 1Orl 
upon which the method to des 
briefly how these formulas are used 

The iterative 
terminates in at steps if no rounding 
encountered Starting with an 
of the solution A, one determines su 
sively new estimates Zo, 71, Ze, of A, the estu 
than 2x, At 
is computed 


sections 
Is based and 
eg-method Is an method 
most 7 
errors are 


estimate a 


each sti 
Normally 


foo 


3 being closer to A 
residual r,=k—Ar 
vector can be used as a measure of the 
of the estimate 2 However, this measure ts 
reliable one because, as will be seen in sect 
it is possible to construct cases in which the 
residual |r increases at each step except 
last while the length of the error vector 
decreases monotonically. If no rounding-ol 

is encountered, one will reach an estimate 2 

at which r,=0. This estimate is the desire ( 
tionh. Normally, m=n. However, since roun 
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rs alwavs occur exc inder vem nus 
stances, the estimates 


ition A but will be 


veneral will not 

rood approximation 
residual is I ree one may cont 
iteration to obt { estimates ¢ 


rience Ina 


ate that | 
iblv better than 


equentl 

should ho 

bevond hould 
mat 

oO as to dimin 

As a matte! 
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where RA R’(d,) are determined by the iteration | Next select a direction p conjugate to p 


3:6) and that is, such that 


Re= FP 
Pisr,<p 0 7=0,1, 
R 

In a sense the cd-method is not precise, in 

P formulas are given for the computation of th 

tlONS Po, Pr, . Various formulas can b: 

with A=A In this connection, it is of interest to | each leading to a special method. The 
observe that if m=n, the determinant of A is given 3:1f) leads to the cg-method. It will be 
by the formula section 12 that the case in which the p’s are o 

by an A-orthogonalization of the basic 

1,0, , 0), (0,1,0, . ; leads essent 

det 7 the Gauss elimination method 
The basic properties of the ed-method are g 
The eg-method can be extended to the case in the following theorems 

whi h Aisa ge neral nonsymmetric and nonsingular Theorem 4:1 The direction rectors Po, Pi; 
matrix In this case one replaces eq (3:1) by the set mutually conjugate The residual vector r, is ort 
tO Po, Pr, > * +5 P The inner product Of peu 
of the residuals ro. r,. _r, 18 the same That 


The scalar a, can be given by the formula 


p / 


a; 
p Ap 


mn place of (4: 1a). 
ws Equation (4:3a) follows from 
This system is discussed in section 10 we find that 


4. Method of Conjugate Directions (cd- PsP Py,? 
1 
Method) If ; k we have, by (4: 1a), (pereg1) =O Moreov: 
by (4:3a) (p,,resi)= (pyre), Yk). Equations (4 


yi k+1 

of a general method. which we shall call the method and pH. ye from these relations The forn 
of conjugate directions or more briefly the cd-method. 4:4) follows from (4:3c) and (4: 1a). 

In this method, the vectors po, p,, . . . are selected 

to be mutually conjugate but have no further restric- 

tions. It consists of the following routine: a aaatl Rie. nd al 

Initial step. Select an estimate zs, of A (the solu- en | ta vectors Po,P~i, - - « 18 Independent of thes 

esl s 


tion), compute the residual r>=k— Ax, and choose a Theorem 4:2. The cd-method is an m-step metl 
seoremn t€ Cb Me tito is adi i-STép he 


direction Po : (msn) in the sense that at the mth ste Pp the estimate 
General routine Having obtained the estimate . th, desived solution | 
. . is desires SO hi 
x, of A, the residual r,—k— Ar, and the direction Pu Fe “ee be tl f , int uch that 2 
. ; yr le . » Sr eee anne 
compute the new estimate a and its residual 7 . "edhe an os Se _ _ a 
by the formulas in the subspace spanned by po, - - -, Pm-1 
: m Sn, since the vectors po,p,;, - - - are linearly in 
pendent We may, accordingly, choose sca 
@, * * +, G@m—, Such that 


The cg-method can be considered as a special case 


\s a consequence of (4:4) the estimates z,,2 
of h can be computed without computing the res 
uals ror), - + -, provided that the choice of 


Yo = Ap Po 
Hence, 
4:le 


This method was presented from a different point of view by Fox, Huskey, and Moreoy er, 
Wilf inson on p. 149 of a paper entitled “ Notes on the solution of algebraic linear 
sim. taneous equations,’’ Quarterly Journal of echanics and Applied Mathe 
matics ¢. 2, 140-173 (1948 : k Ag 
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iting the inner product 
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«l. In order to establish this result, we intro- 
he function 
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Geometrically, the equation f(s const. defines 
an ellipsoid of dimepsion n ] The pomt at which 
f(a) has a minimum is the center of the ellipsoid and 
is the solution of Ar=k. The i-dimensional plane 
P,, described in the last theorem, cuts the ellipsoid 
f(a)=f(a)) in an ellipsoid £; of dimension 7—1! 
unless F; is the pomt J itself In the eg-method 
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Equation 5:7b) is obtained by eliminating 7 


and r,; from the equations 


In order to prove 
from the equations 


follows similarly 
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and Ap 


5:8b eliminate Ap 


Equation (5:8a) holds sines 
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roots of A. 
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6. Properties of the Estimates x; of A in the 
cg-Method 


Let , the estimates of h 
obtained by applying the eg-method. Let ro, 7 

r_» =O be the corresponding residuals and po, 

Pr, , Pn the direction vectors used. The pres- 

ent section will be devoted to the study of the prop- 

erties of the points 2», 2 7 As a first result 
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rder Lo establish 6:5 observe that d 5: . is the point mn ‘af whose distance from the solution h is 
the least It lies on the line J J beyor dx, Voreorer. 


In order to establish (6:9 » use the 
formula 


and T we obtain (6:5 
6:6) and 
result establishes the cg-method as a me thod 


which holds for all values of @ in view of the fact that 


x; minimizes f(r) on P Setting a= f(s 


essive approximations and justifies the pro- | We have 


ol stopping the algorithm before the final- 
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/ 


we obtain (6:10) from (6:9) and 
As a further result we have 
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this end observe first that the Thus it is seen that we proceed towards the solu 
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and using (5:6a The result ts 
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r, be the @ -dimensional plan through 2 
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Ap 


This plane contains the points J r,, and hence 


the solution / 

Theorem 6:7 The gradient of the function f(x) ata 
n the plane T sa scalar m ulti ple of the vector p 

The gradient of f(x) at x; is the vector The 
gradient g, of f(x) at 2, in m, is the orthogonal projec- 
The pow tion of r, in the plane x Hence gq, is of the form 


6:2). we obtain (6:7 The points lie in the 


ited order, since f(z At: 
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Pp accordingly, is a scalar m iltipl of the gradient q 
ot f(r) at 
In view of the 
seen that the 
Is an appropriate name tor the 
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direction Pp of the gradient of f(x) at a obtaining 
a minimum value of f(z) at Since the solution h 
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to p 
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as Was to be proved 
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result obtained 
name method of conjugate gradients 
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lies in 
Accordingly in the next step, we relax in the direc- 
tion p of the gradient of f(r) in wr, ata obtaining the 
point z, at which f(z The problem is then 
reduced to relaxing f(z conjugate to 
Po and p At the next 
and soon The dimens onality of the space in 
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step Accordingly, after at 
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is least 
im the plane T 
step the gradient in 7, in x 
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which 


unity at most 7 


‘ ach 


steps thre desired solution 


7. Properties of the Estimates x; of A in the 
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Inn the ey-ni thod there is a second set of estimates 
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Formula (7:3b) is 
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conseq l¢ 
To prove > ore 
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Theorem 7:2 The point T, given by 
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r, at which the square / residual | 
This minimum value 
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The first statement follows from (7:3b 
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the form 
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show 


The residual 
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The first statement follows from 
follows from theorem 6:2, since 7 
7:2) the point J, lies in the line segment joining 
i to zr The distance Irom h to T exceeds thre 


distance from fh to 7 It follows that 


from 7, to Z the distance from f/ ts 
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this result we use (5 and | “! sah aia 


as we move 


8. Propagation of Rounding-Off Errors in 
the cg-Method 
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‘ tf r this relat the vectors 
‘Oa The eq (7:6b follows from (7:6a method the 


st statement follows from (5:12 


the applications to linear svstems arising from 


ee equations approximating boundary 
can be taken as a measure of the 
The Sinaile rv is, the smoother 


Lo prove this re lit convenient 
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( nad l 
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8.1. Basic propagation formulas 


In this part we derive simple formulas showing 
how errors in scalar products of the type 


Apy..p 8:3 


are propagated during the next step of the computa- 
tion. From (8:le) follows 


Inserting 


y 
to the first and third terms gives 
a PisAp h a Ap ‘Po; (S:4 


which by becomes 


p 


This is our jirst propagation formula 


Using (S:le) again, 
Ap WL Ap jy 


Inserting (7:le) in the first term, 


Ap,,p 


But in view of (8:1b) and (S:1d 


a h Ap . 
Therefore, 


Apu p 


This is our second propagati n formula 
Putting (8:5) and (8:8) together yields the third 
and fourth propagation formulas 


Ap,.p 


which can be written in the alternate form 


8:10b) 


by virtue of (8:1b) and (8:ld). Each of these 
gation formulas, and in particular the simpk 

las (8:9), can be used to check whether nonvan)shj 
products (8:3) are due to normal rounding-off ' 
or to errors of the computer. The formulas (8 
have the following meaning. If we build th: 
metric matrix P having the elements (Ap,,7 
left side of (8:10b) is the ratio of two consi 
elements in the same line, one located in thy 
diagonal and one on its right hand sid 
formula (8:10b) gives the change of this ratio 
go down the main diagonal 


8.2. A Stability Condition 


Even if the scalar products (8:2) are not al 
so that the vectors po, Pi, - + +, Pa-1 are not ex; 
conjugate, we may use these vectors for sob 
Az=k in the following way. The solution A m: 
written in the form 


+ dop +a P H. + a Pp 


Taking the scalar product with Ap;, we obtai 


Ap;,p,)a, = (h, Ap Ah,p 


The system Az=k may be replaced by this lin 
system for a),- + -,4, Therefore, because 
rounding-off errors we have certainly not soly 
the given system exactly, but we have reache 
more modest goal, namely, we have transformed t 
given system into the system (8:12), which has 
dominating main diagonal if rounding-off errors ha 
not accumulated too fast. The cg-algorithm 
an approximate solution 


h’ Tot GoPot + er An-ipy 


A comparison of (8:11) and (8:13) shows that 
number a, computed during the cg-process is 
approximate value of a, 

In order to have a dominating main diagona 
the matrix of the system (8:12) the quotients 


Ap,,p ‘tk 
App 
must be small. In particular this must be true 
k=i+1. In this special case we learn from (8 
that increasing numbers do, 4,,- during th 
process lead to accumulation of rounding-off errors 
because then these quotients increase also. W 
have accordingly the following stability conditior 
The larger the ratios a,/a the more rapid 
rounding-off errors accumulate 
A more elaborate discussion of the general quote 
(8:14) gives essentially the same result 
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nvectors as a coordinate system 


theorem 5:5, the scalars a, lie on the range 


Amin, Amax are the least and the largest eigen- 
sofA. Accordingly, the ratio p= XAmax/Amin IS aN 
bound of the critical ratio a,/a,_,, which deter- 
the stability of the process. When p is near 
hat is, when A is near a multiple of the identity, 
g-method is relatively stable. It will be shown 
tion 18 that examples can be constructed in 
the ratios a,/a =} “m—1) are any 
ff preassigned positive numbers. Thus the 
ty may be low. However, this instability can 
mpensated to a certain extent by starting the 
ess With a vector z) whose residual vector r, 
ir to the eigenvector of A corresponding to Amin. 
s event do is near to the upper bound 1/Anm of 
This result is brought out in the following 


rem 


a given symmetric and positive definite matrix A, 
has distinct eigenvalues, there exists always an 
! residual vector ro such that (a;/a;_,;)<1 and henee 
that the algorithm is stable with respect to the 


ation of rounding-off eTTorTs 


order to prove this we introduce the eigenvalues 
A, ALKA Ase 


and we take the corresponding normalized 


Let GQ, a), 


be real numbers not equal to zero and ¢ a small 
tity Then we start with a residual vector 


Mp, Ay €, AoE”, S:l4a 


panding everything in a power series, one finds 


8:14b 


s small enough 


. by-product of such a choice of 7 
approximations of the 


we get by 
eigenvalues of A 


over, it turns out that in this case the successive 


al-vectors rp, 7), ly are approximations of 
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se results suggest the following rule 
CO-process should start with a smooth residual 
that is. one for which ul? tO Amin. 
led. the first estimate can be smoothed by some 
ion process. 

ourse, we may use for this preparing relaxation 
itself, computing the estimates 7 
n section 7. <A simpler method is to modify 
process by setting 6,—0 so that p;—r, and 
ng a, of the form a,—a/u(r where a is a 
onstant in the range 0< a< 1 


ition, is close 


-process 
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8.3. The End-Correction 


The system (8:12) can be solved by ordinary re- 
laxation processes Introducing the numbers a, as 
approximations of the solutions a;, we get the resid- 
uals 


Ap jp ad 


Inasmuch as 7 
we have 


ro,pP ip a Ap p ad Ap Pi) 


is reduced to 


It follows that the residual (8:15 


Ap p a Ap p 
Apip a 
This leads to the correction of a 
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Aa 
Ap ip 


Api.p 
Ap,,p Apap a 
\ first approximation of a; is according] 


Ad... 


that the 
accordingly Lo 


result 
computed 


this 
been 


discuss 


have 


In order Lo 
numbers a 
the formula 


suppose 


Pi! &-19 
PiAp 

Krom S: le it follows that 
riai.P)—0, and therefore this term drops out in 
S:18 In this the correction Aa, depends 
only on the products (Ap,p,) with ick. That is 
to say, that this correction is influenced only by the 
rounding-off after the i-th step If, for 
instance, the rounding-off errors in the last 10 steps 
of a cg-process are small enough to be 
the last 10 values a 
generally, the Aa, decrease rather rapidly 

From (8:18) we learn that in order to have a good 
rounding-off behavior, it is not only 
keep the products (p,,Ap,) ((#k) small, but also to 
satisfy (ry.4.p 0 as well as possible. Therefore 
it may be better to compute the a, from the formulas 
8:19) rather than from (S:1b We see this 
mediately, if we compare (8:19 Rh 
8-19) and (8: le) we have 


theorem 5:5 


case 


errors 


neglected 


Hen ce 


need notto be col rected 


necessary to 


Im- 


with by 


ill-conditioned matrices 
considerably larger 


For where a and b 
become than 1, the 
of the second summand may cause additional errors 
For the same reason, it is at least as important in 


may 
omitting 





thes« cases to use tormul 


for determining 6 


Here the second summand is not directly mad 


anv of the two sets of form ilas for a, and 


‘ 
ero py 
by The onlv orthogonality relations. which are 
directly fulfilled in the scope of exactitude of the 
numerical computation DY the choice of a, and b 


nre the follow inne 


Therefore we have { epre { terms 


ol thre = Si ala produc ts 


h 


From this expression we see that for large 6, and a 
the second and third terms may Cause considerable 
rounding-off errors, which affect also the relation 
p Ap 0. if we use formula (8:1d) for 6 This 
is confirmed by our numerical experiments (sec- 
tion 19 

From a practical point of view, the following 
formula is more advantageous because it avoids the 
computation of all the products (Ap,,p From 
8:1e) follows 


and we have the result 


Aa ae. 8:20 


This formula gives corrections of the a, if, because 
of rounding-off errors, the residual r, is not small 


enough 
8.4. Refinement of the cg—algorithm 


In order to diminish rounding-off errors in the 
orthogonality of the residuals 7, we refine our general 
routine (8:1 After the 7th step in the routine we 
compute (Ap;_;,p,), which should be small. Going 
then to the l)st step we replace a, by a slightly 
different quantity @, chosen so that (r,,r; 0. In 
order to perform this, we may use (8:4), which now 


must be written 


vielding 


Introducing the correction factor 


Ap }/ 


Ap, p 


and taking into account the old value (S:]I 
this can be written in the form 


Continuing in the general routine of the 
we replace 6; by a number 6, in such a wa 
Ap p 0 We use (8:6 which now 1 


written in the form 


The term (7,7; vanishes by virtue of ou 
of a Using (8:7), we see that a a>. an 
R99 


4h d 


Since rounding-off errors occur again, this 
routine can be used in the same way to impro 
results in the (4 l)th step. 

The corrections just described can be Incorpo 


automatically in the general routine by replacing 


formulas (3:1) by the following refinement 


d 


Another quite obvious, but numerically 
laborious method of refinement goes alo 
following lines After finishing the 7th step, e 
a product of the type (Ap,,p,) with / 
replace p,; by 


Ap;,p 
Pil De. 
Apx, p 
The vector Pp. 8 exactly conjugate to p 
method may be used in place of (8:24 
l. , ; ‘ ‘ j 
-—i1—1 It has the disadvantage that a ve 
be corrected and not just 7 umbers, as in (8:24 
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9. Modifications of the cg-Process I. The vector p, can be chosen to be the residual 
vector ry described in section 7. 
he eg-method given by (3:1), the lengths of the In this event we select 
S Po, Pi, - - - are at our disposal. In order to 
ve significant figures in computations, it is d , dis) 
ble to have all the p, of about the same magni- 
In order to aim at this goal, we normalized | The formula (7:2b) for 7,,, becomes 
; by the formulas 


P y ?P } 0). 


other normalizations can be made. In Il. The 
see how this normalization appears, we 
Pp by d PP; In eq 3:1), where d is a scalar i 
This factor d, is not the same as that given D=D> = 
on 8 but plays a similar role. The result is ss 
holds. 

In this event the basic formulas (9:1) take 

simple form 


vector p; can be chosen so that the formula 


ry 


Ary 


Pi, Ap 


Ap 
Pi, Aps) Pi+1 


tions between d, are given by ee ; 
; This result is obtained from Y:] by choosing 
dy d, 
- 0 i 
In this case the formulas (9:5) are very simple and 
d are particularly adaptable to computation. It has 
- oe. the disadvantage that the vectors py may grow con- 
; siderably in length, as can be seen from the relations 
r) is the Rayleigh quotient (4:12). In order 
tablish these relations we use the fact that 7 
are orthogonal. This yields 


However, if ‘floating’ operations are used, this 
should present no difficulty. 
r . . ia III. The vector P; can he chosen to be the correction 
: = to be added to 2X; in the ” ] st relazration. 
ie of .the formula 7 r;—a,Ap,. From the In this event, a,—1 and the formulas (9:1) take 
tion between p, and r,, we find that the form 


r 
h - Arg, pr=> 


lds (9:2) in case 7>0. The formula, when 
lows similarly. 
formulas (9:1) the scalar factor d; is an 
positive number determining the length of 
» case d,—1 is discussed in sections 3 and 5 
owing cases are of interest 
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9:1) and (9:2) by 


These relations are obtained from 
setting a l 
lV The vector p,can he chosen so that a; is the recip- 
rocal of the Rayleigh quotient of 1 
The formulas for a;, 6; and d 


/ 


in (9:1) then become 


a 


Ai 


h a 


a 


do=1, d l- 


the variety of choices 
d,. 


| appears 


This is sufficient to indicate 
that can be made for the scalar factor do, 
For purposes of computation the choice d 
to be the simplest, all things considered 


10. Extensions of the cg-Method 


In the preceding pages we have assumed that the 
matrix A a positive definite symmetric matrix 
The algorithm (3:1) still holds when A is nonnegative 
and symmetric. The will terminate when 
one of the following situations is met 

1) The residual ;, zero. In this event 
a solution of Aa=k, and the problem is solved 

2) The residual r,, is different from zero but 
Apm.P 0, and hence Ap,,=0 Since py=eil i, 
it follows that A7,,—0, where r,, is the residual of the 

Z,, defined in The point 7, 
a point Ar* attains 
In other words, 2 is a least-square 
One should observe that Pm (and hence 
Otherwise. we would have ,. 4b, iPm 
to the fact that is orthogonal to Pn 
fails to minimize the function 


Ag 2(k 


Is 
routine 


is c. wi 


Is 


section 7 


which 


vector 2 
accordingly 
minimum 
solution 
=) 


} its 


at 


oo 
contrary 
The point z, 


7 


gir r r 


for in this event 


tp g(a 2tir, 


In fact, g(a) fails to have a minimum value 

It remains to consider the case when A is a general 
nonsingular matrix. In this event we observe that 
the matrix A*A is symmetric and that the system 
Ar=k is equivalent to the system 


A* Az= Ak. 


to this last system, we obtain 


Q(X» 


10:1 


Applying the eq (3:1 
the following iteration, 
I: 


7 Axo, 


If one does not wish to use any properties 
eg-method in the computation of a, and 6 
the defining relations, since they may be d 
by rounding-off errors, one should use the fi 


(Api,? 
Ap 


In this case the error function f(2) is the { 
hr k—Agr and hence is the squared rv 
It is a simple matter to interpret the results 
above for this new system. 

It should be emphasized that, even though 
of the system (10:2) is equivalent from a the 
point of view to applying the cg-algorithm 
system (10:1), the two methods are not equ 
from a numerical point of view. This follows | 
rounding-off errors in the two methods are 1 
same. The system (10:2) is the better of th 
because at all times one uses the original mat 
instead of the computed matrix A*A, wh 
contain rounding-off errors 

There is a slight generalization of the system 
that is worthy of note 
of selecting a matrix B such that BA is positiv: 
nite and symmetric. The matrix B is necessa 
the form A*//I, where // is positive definit 
symmetric. We can apply the cg-algorithm 
system 
B 


SAX k. 


one obtains the algorithm 


I 


In plac c of 


, 
/ 


Bria, byp;. 


Again the formulas for a, and },, which ar 
directly by the defining relations, are 


(p;, Bry) 
\ Pi, BAp;) 


a 
(Br..,,BAp)) 
(p BA py 
When B=A*, this system reduces to (10:2 
is symmetric and positive definite, the choi 


gives the original cg-algorithm. 
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This generalization consis 


ere 18 a generalization of the ed-algorithm con- 

i which a few remarks should be made. In 

method we select vectors po, . . ., Pp, and 
» In-; Such that 


solution can be obtained by the recursion for 


problem is then reduced to finding the vector 
such that (10:5) holds. We shall show in 
ent that q; is of the form 


q B*p 10: 


B has the property that BA is symmetric and 
ve definite. The algorithm (10:6) is accordingly 
ilent to applying the ed-algorithm to (10:3). 
that q; 1s of the form (10:7), let P be the 

< Whose column vectors are Po, . . ., P»—; and Q 
matrix whose column vectors are do, . . ., Jn 
ondition (10:5) is equivalent to the statement 
he matrix D=(*AP is a diagonal matrix whose 
onal terms are positive Select B so that 


*P. Then D=P*BAP from which we conclude 


j 


n 


A is a positive definite svmmetric matrix, as 
» be proved 

view of the results just obtained, we see that 
gorithm (10:4 St ge neral eg-algorithm 
j linear system. Similarly, the most general 
gorithm is obtained: by (i) selecting a matrix 
h that BA is symmetric and positive definite, 
electing nonzero vectors p » Pa-, such that 


is the mo 


(p;,BAp 0. iF] 


, using the recursion formulas 


Pi, Bro 


Construction of Mutually Conjugate 
Systems 


vas remarked in section 4 the cd-method is not 
ete until a method of constructing a set of 


uly conjugate vectors po, pr, has been 


In the eg-method the choice of the vector 
p; depended on the result obtained in the previous 
step. The vectors po, p;,. . . are accordingly deter- 
mined by the starting point z) and vary with the 
point 2 

Assume again that 
metric matrix. In a 
Pi, « « can be chosen to be independent of the 
starting point. This can be done, for example, by 
starting with a set of n linearly independent vectors 
Un, Uy, . » @ 


riven 


A is a positive definite, svym- 


cd-method the vectors Po: 


and constructing conjugate vectors 
A-orthogeonalization process For 
example, we may use the formulas 


by a successive 


The coefficient a,;,(i >) 


The 


is to be chosen so that py. is 
conjugate to p formula for a;, is evidently 


u, AD 
f 1] 
py, Ap 


Observe that 


Using (11:3) we see that alternately 


AGuP 11:4 
Au,;,p 


As described in section 4 
the solution are 


the successive estimates of 
given by the recursion formula 


a p 
where 


There Is a second 
vectors Po, Pry 


formulas 


the 
recursion 


method of computing 
given by the 
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We have the relations (11 
Au 0 qh 11:8a) 
Ap 11:8b) 
11:S8e) 
0 | 11:8d 


The eq 11:S8a) hold when It] by virtue of 
(11:7¢) and (11:7d). That they hold for other 
values of j<k follows by induction. Equation 
(11:8e) follows from (11:7¢ 

If one selects successively w=7o, W=r 
Un—1=Pn-1, the procedure just described is equivalent 
to the pow bow de described in section 3, in the sense 
that the same estimates Zp, Z,, . . and the same 
direction vectors Po, Pi, - . are obtained. If 
one selects wu k. u Ak, — A" 'k, one 
again obtains the same estimates 2, 2;, .. . as in 
the cg-method with 2,—0. However in this event 
the vectors po, p;, . . . are multiplied by nonzero 
scalar factors On the other hand if one selects 
U, = u a 0), u, 0, 

0.1) the ed- method is equiv: alent to ) the Gauss 
elimination method. This case will be discussed i 
the next section. 


12. Connections With the Gauss Elimina- 
tion Method ” 


In the present section it will be convenient to 
use the range 1, . , 7m plac eoi0,1,...,M l 
a heretofore, except on the notations Jo, 7, . 

, describing the successive estimates of the solution. 
bot «4 se ae be the unit vectors (1,0, . 
os a eee 0,. . ..0,1). These ‘vectors 
will play the role of the vectors wo, =» Geng OF 
section 11. The eq (11:7), together with 11:4) and 
(11:5), yield the recursion formulas 


(12:la 

(12:1b 

(12:1c) 

, o 
As,0) (12:1d) 
Lee TOP i (12:le) 


\Pi, k) 


Ase (12:1f) 


a,= 


These formulas generate mutually conjugate vectors 
Pi, « - +» Px and corresponding estimates 2, . . ., Zp 
of the solution of Ar=k. In particular z, is the 
desired solution. The advantage of this method 
lies in the ease with which the inner products appear- 


{ “ef. Fox, Huskey, and Wilkinson, loc. cit. 


| 
| 


ing in (12:1d) and (12:1f) can be compute 
systematic scheme for carr \ ing out the ¢ omput 
will now be give n. The scheme is that con 
used in elimination. In the presentation th 
now give, extraneous entries will be kept so 
give the reader a clear picture of the results obt 

We begin by writing the matrices &, £@ 
vector & as a single matrix 


Ani GAno a, - Gea 0 


The vector Pr is the vector 1% Se 
k,/a,, is defined by (12:1f). Hence, 


a,p, 


is our first estimate. Observe also that 


Multiplying the first row by aq and subtracti 
result from the ith row (i=2,- - - ,n), we obta 


new matrix 


Pin 


One should observe that (0,42 ,. . .,.4®) is the residu 
of x, By the procedure just described the ith roy 
(¢>1) of the identity matrix has been replaced by 
uf. te second row yielding the vector p, 2». O 
serve also that 

af =(AuF 


a? 
a3; (Apo,é.), 
Hence, 


kq 
I2> Mts P2 


23 
is the next estimate of the solution. Moreover, 


ay . 
“= — (g==3, . . -.%)e 
2 a? ’ os 
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» and sub- 


We ob- 


multiply the 2nd row of 
the result from the ith row 


u 


027... ..f 
elements U;),. . .,U,; 


u®. We have 


vector is the residual of 2 


form the vector u‘ 


ding in this manner, we finally obtain a matrix 


form 


j 
12:4 


lements Pa,** * P define a vector Pi The 
WS py, Pp, are the mutually conjugate 


s defined by the iteration 12:1). At each 


ver the estimate z, of the solution A is given 


formula 


ctor U,- - +, O, G5’, + - +, @; defined by the 
elements in the ith row of (12:4) is the vector 
If we denote by P the matrix whose column 


vectors are Pi, Pe, then the matrix (12:4) 
is the matrix 


ra 


The matrices P*A and P are triangular matrices 
with zeros below the diagonal. The matrix D—=P*AP 
is the diagonal matrix whose diagonal elements are 
ay, a2, a The determinant of P is unity and 


the determinant of A is the product 


a,,a . a 


As was seen in section 4, if we let 


the sequence 


No 


S¢ quene ( 


monotonically general statement 


made for the 


decreases 
can be 
y 

of lengths of the error vectors y h—2 In fact, 
we shall show that this sequence can Increase mono- 
tonically, except for the last step. <A situation of 
this type cannot arise when the « g-process 1s used 

If A is nonsymmetric, the interpretation given 
above must be modified somewhat An analy SIS of 
the method will show that one finds implicitly two 
triangular matrices P and Q such that Q*AP is a 
diagonal matrix. To carry out this process, it may 
be necessary to interchange rows of A By virtue 
of the remarks in section 10, the matrix Q* is of the 
form B*P. The general procedure is therefore equiv- 
alent to application of the above process to the SVs- 
tem (10:5 


13. An Example 


of the solu- 
the error 


In the « e-method the estimates 2»,2 
tion A of Az k have the property that 
vectors ¥ h J y h J are decreased in 
length at each step This property is not enjoved 
by every cd-method In this section we 
an example such that, for the estimates 7 Og 
of the elimmation method, 


construct 


If the order of elimination is changed, this property 
may not be preserved 

The example we shall give is geometrical instead 
of numerical. Start with an (n—1)-dimensional 
ellipsoid £, with center z,—A and with axes of un- 
equal length. Draw a chord C, through 2,, which 
is not orthogonal to an axis of Select a point 
Inn) #2, ON this chord inside £,, and pass a hyper- 
plane P,_,; through z conjugate to C,, that ts, 
parallel to the plane determimed by the midpoints 
of the chords of £, parallel to ¢ Let e, be a unit 
vector normal to P, It is clear that e, is not 
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parallel to ¢ The plane P,_, can be shown to cut | tinued fraction expansions. To develop th 
E,, in an (n—2)-dimensional ellipsoid /,_,; with cen- | first study connections between orthogonal] 
ter at z and with axes of unequal length nomials and n-dimensional geometry 

Next draw a chord ¢ of EF throughs which Let m(X\) be a nonnegative and nonde¢ 
is not orthogonal to an axis of £ ,and which is not | function on the interval 0<A</. The (Ri 
perpendicular to A—z One can then select a | Stieltjes integral 
point z on ¢ which is nearer to Athanz, Let 
P be the hyperplane throug! J conjugate to 
( It intersects E in an (n—3)-dimensional 
ellipsoid £ with center at 2 The axes of E, 
can be shown to be of unequal lengths. Let ¢ be 
a unit vector in P,_, perpendicular to P 

We now repeat the construction made in the last 
paragraph. Select a chord ¢ of k through 
I that is not orthogonal to an axis of EF and that 


then exists for any continuous function / 
O<A<I. We call m(\) a mass distribution « 
positive A-axis. The following two cases m 
distinguished. 
a) The function m(\) has infinitely many 
is not perpendicular to A—a Select 2 on ¢ of increase on OA l . 
nearer to A than z,_», and let P,_; be a plane through (b) There are only a finite number n of po 
; conjugate to ¢ i. ente i man (n—4)- | mecrease In both cases we May constru 
dimensional ellipsoid £ with center at a with orthogonalization of the successive powers | 
axes of unequal lengths. Let « be a unit vector in , A" asset of n+1 orthogonal polynomials 
P and P perpendicular to P Clearly, ¢,, RA).RA). - Rd 
é é,-2 are mutually perpendicular. a 
Proceeding in this manner, we can construct 
1) Chords C,, ¢ , (,, which are mutually 
conjugate. 
2) Planes P P, such that P, is conjugate ROORA)dm(r)=0 i Xk). 
to ¢ The chords C;, , (, lie in P, st 
3) The intersection of the planes P,_;, . . ., P,, The polynomial R,(A 
which cuts £, in a (k—1)-dimensional ellipsoid £ R.0) is’ 
with center z, ohg 
1) The point 7 which is closer to A than 7 


with respect to the mass distribution. One has 


1S of degree i In Cast 
) is a polynomial of degree n having its z 
the n points of increase of m(A In both cases 
zeros of each of the polynomials 14:1) are real 
distinct and located inside the interval (0,/). H 
we may normalize the polynomials so that 


yi 


| 
5) The unit vectors ¢ é», & (with e, in the 


direction of C,), which are mutually orthogonal. 
Let x) be an arbitrary point on (, that is nearer to > 

. Rh 

A than a Select a coordinate system with 7» as the 
origin and with e,, . . ., e, as the axes. In this co- 
ordinate system the elimination method described 
in the last section will yield as successive estimates 


(0 i & se 


The polynomials (14:1) are then uniquely detern 
by the mass distribution. 
During the following investigations we usi 
Gauss mechanical quadrature as a basic tool. It: 
mates have the property that zx; is closer to 7,=h ie dees aa tlie: WE tes « x. cee 
than a if i<in ; oy! T 
As a consequence of the construction just made we 
see that, given a set of mutually conjugate vectors 
Pr, » Ps and a starting point 2», one can always *l 
choose a coordinate system such that the elimination RA)d m(A)= mM, RA mR As 
method will generate the vectors p,, . . ., P» (apart “— 
from scalar factors) and will generate the same esti- 
mates z,. . ., 2, 0fh as the cd-method determined by | whenever R(d) is a polynomial of degree at mos 
these data One needs only to select the origin at an l. In the special case b) the Xz are the abscissas 
x, the vector ¢, parallel to p,, the vector ¢, in the | where m(d) jumps and the m, the correspondir 
plane of p, and p, and perpendicular to e,, the vector jump. 
és in the plane of p,, p2, ps and perpendicular to « In order to establish the duality mentioned 
and ¢,, and so on. This result may have no practi- | title of this section, we construct a positive defin 
cale value, but it does serve to clarify the relation- matrix A having d,, »». - - -. A, as eigenvalu S 
ship between the elimination method and the cd- | jngtance. the diagonal matrix having \, - - +) 
method, and also the relationship between the the main diagonal and zeros elsewhere). Furth 
elimination method and the eg-method. more. if 6. a e. are the normalized 


14. A Duality Between Orthogonal Poly- 
nomials and n-Dimensional Geometry p= Oye, + alg t+ ++ + toner, 


the points 7, _az, deseribed above These esti- 


zeros of R,(d), there exist positive weight coefficier 
m;, Mo, « + «mm, such that, 


vectors of A, we introduce the vector 


4 he . ; . ; aT > . ous ‘rties of « ogonal polynomials used in this chapt 
e method of conjugate gradients is related to | I'he various propertic wthogonal polynomials use 
I h . th f nj ga Sra li found in G. Szegé, Orthogonal Polynomials, American Mathemat 


the theory of orthogonal polynomials and to con- | Colloquium Publications 23 (1939 
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Again we may restrict ourselves to the powers 
A That is, we must show that 


ien have 
ay Ay" 6 / . r + | ’ 14:10 
NM l The vectors Ar 


0,1, l, this has already been verified The 


are linearly independent and will be used as a remaining cas 
inate system. Indeed their determinant is up 
factor a,a 27 * Van det \Monde’s determi- 
A By the correspondence A" dm d A" A / “S/ 14:11 


Ay Ol, + + «8 14:8 


follows in the same manner from Gauss’ integration 
) 


, | formula, since n+k<2n—1 
polynomial of marimal degree n—1 is mapped Theorem 14:3. Let A be 


. | a positive definite sym- 
i vector of the n-dimensional space and a one- f 


metric matrir w th d stinel €1de nvalue s and let } he a 
rrespondence between these polynomials and j yl} ; 
fecio7s Cité 


Ss 1s established The correspondence has the Th 
ie 


‘ . 
tt is not perpend cular to an eigenvector 


‘ a mass distribution mix related to 
ng proper 1¢s ] ve 

CePRCTIO aho ‘ 
em | } l Le 4 the space of polynom als R \ 


é <) l he j ”? the norm 


j 


In order to prove this result let ¢ : 
the normalized eigenvectors of A and let X : 
A, be the corresponding (positive) eigenvalues. The 
R | j R ] vector rp is expressible in the form (14:5 According 
i e(A)-dm(A ; pales 
to our assumption No a vanishes Che desired mass 
distribution can be constructed as a step function 
hed abhor fy that is constant on each of the intervals 0< 
A,</, and having a jump at A, of the amount 
7 m a 0, the number / being anv number greater 
Rd) RR’) dm (x rr’). than A 
J0 We want to emphasize the following property of 
oul correspondence If A and ; are given, we are 
omials corresponding able to establish the corredence without computing 
eigenvalues of A. This follows immediately from the 
sufficient to prove this for the powers  - basic relation (14:8 Moreover, we are able to com- 
r" Let ’, * be two of these powers pute integrals of the tvpe 
Gauss’ formula (14:4) follows 


COTES Pt ndence 


, -] Rr) R’ (+) dm (d), ROY) RON Adm (A 
Adm (r [ dm (x 14:12 


mr — j : where R, R’ are polynomials of maximal degree 

n—1 without constructing the mass distribution. 

14:5). (14:6). and show that this is Indeed, the integrals are equal to the corresponding 
the scalar product (A’ry,A*ry) of the cor- | scalar products (r,r’), (Ar,r’) of the corresponding 
ding vectors vectors, by virtue of theorems 14:1 and 14:2. Finally, 
wem 14:2. Let the space of polynomials R(x the same is true for the construction of the orthog- 
vee <n 1 be metrized by the norm onal polynomials Rr), Ry ROX) because 
j the construction only involves the computation of 

integrals of the type (14:12 The corresponding 


[r r)2rvAdm (r ‘ vectors 7 » 7 Be % build ar orthogonal hasis in 
0 the Kkeuclidian } 8 pace 


jee polynomials R(d),R’(X) corresponding to 15. An Algorithm for Orthogonalization 
In order to obtain the orthogonalization of poly- 
nomials, the following method can be used lor 
U (A) RY (A) Adm (A Ar,r’), 14:9 any three consecutive orthogonal! polynomials the 
recurrence relation holds 


the corresponde nee is isometric with respect lo 
ght function \dm(X) and the metric. determined 


R / f ar R \ 





real numbers and a,#0. Taking 
14:3), we have 


where a,, ¢ d are 
into account the normalization 


Hence 
R d 


This relation can bi 


Riss —R 
aX 


R 


From this equation it is seen by induction that 
P » I Ii 


are polynomials of degree Introdue ng the num- 


a \P \ 


Beginning with R l, we are able to compute by 
(15:5) successively the polynomials P»>=R)=1, Rj, 
P,, R,, P,, -» prov ided that we know the numbers 
a,. b In order to compute them, observe first the 
relation 

ixtk 


15:6 


Indeed this integral is up to a constant factor 


“7 
Ris, -R)P.dmOr 


i this is zero, because the second factor is of 


For k 


degree k Be 


Using (15:5a) and (15:6), we obtain 


P.(r)?dm(d). 


. 


i 
R, »)P nN Adm nN 


0 


Combining this result with the orthogonality of 
R 1 and R,, we see, by 15 5b . that 
™ 


| R(rAdm(r 


Par *~dm(r 


. 


Using (15:6) and (15:5a) 


0 ( RNP; -1(\)\dm(a) +6 (A)?Adm(A). 
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Applying (15:3) to the first term vields 


1 ff 
R IN 2dm r =f P iN 2=~dm 


Gi-1, 


Combining this result with (15:7), we obtain 


| \ dm r 


| 2 i(A)*dm 


The formulas (15:5), 15:7), 15:8), togethe 
P,=1, 6.;=0, completely determine the poly 


Ro, R;, , Me 


16. A New Approach to the cg-Method 
Eigenvalues 


In order to solve the system Ag k. 

residual vector 79> =k— Axy of an initial estimat 
the solution A and establish the correspondenc: 
on A, ry, described in Theorem 14:3. Withou 
puting the mass distribution, the orthogonal 
process of the last section may be carried 0 
15:5), (15:7) and (15:8) with Ry=—1, b 0) 
vectors 7r;, p; corresponding to the polyno 
R,, P; are therefore determined by the recu 
relations 


we comp 


P r.t-b Pp ? } } ( Ap 16 

Multiplication by \ in the domain of polynom 
is mapped by our correspondence into applying 
in the vector space according to (14:11 In fact 


Pi=P(A)ro, T1=—R(A)ro [=0. 1. 


The numbers a,, 6; are computed by (15:7) and 
Using the isometric properties described in theor 


14:1 and 14:2, we find that 


7 


Ap;,p 


The vectors r; are orthogonal, and the p; a 
jugate; the latter result follows from (15:6). Hs 
the basic formulas and properties of the cg-met! 
listed in sections 3 and 5 are established. It rema 
to prove that the method gives the exact solut 
after n steps. If we set 2;,,;—2,;+a,p;, the 
sponding residual is r;,, as follows by inductior 


k—Axi,;=(k—Az,;)—a,;Ap 
For the last residual r, we have 
\P'n,7 i) = 


| 2. Ridm—a, 
e 


[ 2, R.dm=—O. 








ethod 


s \ 


| P \ 





kur basic method reestablishes also the methods 


Lanczos for computing the characteristic poly- 
al of a given matrix A. Indeed the polynomials 
omputed by the recurrence relation (15:5), lead 
ly to the polynomial ?,,(A), which, by 
nition of the correspondence in section 14, is the 
acteristic pe lynomial of 4 prov ided that ry satis- 
the conditions given in theorem 14:3. It may 
membered that orthogonal polynomials build a 
Therefore the 


the basic 


man sequence polynomials 
R R, build a Sturmiar Sequence for the 
ues of the given matria A 
7 correspondence allows us to translat« every 
hod or result in the vector space into an anal- 
s method or result for polynomials and vice 
1. Let us take as an example the smoothing 
ss in section 7 It is easy to show that the 
r Ti; introduc ed in that section corresponds oa 
iomial R,(A) characterized by the following 
ertv: R(X) is the polynomial of degree 7 with 
1 having the least-square integral on (0/ 
ther words, if rp) is given by (14:5), then 
» oF Ry 0d; 2 Rid mipimum 
result may be used to estimate a single eigen- 
of A. In order to compute, for instance, the 
Sst ele nvalue \,, we select ry near to the corres- 
ng eigenvector The first term in the expan- 
pDelIng dominant. the sn allest root of R \ will 
rood approximation of \,, provided that 7 is not 
mall. Hence the last residual vanishes, being 
ogonal to 7p, 7 It follows that x, is 
esired solution 


17. Example, Legendre Polynomials 


known set of orthogonal polynomials vields 
xample of a cg-algorithm. Take, for instance, 
Legendre polynomials. Adapted to the interval 


they satisfv the recurrence relation 


R d), RO ] 


QINR AA 
| l 





This gives the following result, let A be a symmetric 
matrix having the roots of the Legendre polynomial 


R,0X) as eigenvalues, and let 

y aye Oot Apt 
where ¢ . ‘ — are thi normalized eigenvectors of 
A, and ” a m OG m a are the 


weilcht-coefficients for the 
rature with respect to R 


Gauss’ mechanical quad- 
The eg-algorithm applied 


to A wy vields the numbers a,, 6; given by (17:1 
Moreover. 
] 
) h._b h ; } 
; 2 ] 
Hence the residuals decrease during the alogrithm 


It may be worth noting that the Ravleigh quotient 
of r, is 
ry Mr l D l 
/ a a 4 


All residual vectors have the same Ravleigh quotient, 
This that, unlike many other relaxation 
methods, the ¢g-process does not necessarily have 
the tendency of smoothing residuals 
The Chebyshev polynomials vield 
where a,, 6; are constant for 0) 


show S 


an ¢ xample 


18. Continued Fractions 


Suppose that we have given a mass distribution of 
tvpe (b in 14. The function 
m(X) 1s a step function with jumps at O<A Pn 

\,</, the values of the jumps being m,, m - 
Me, respectively It is well known" that the orthog- 
onal polynominals /2)(A), RA), . Rar 
sponding to this mass distribution, can be constructed 
bv expanding the rational function 


as deseribed section 


” ” 
Fn 18:1 
\ d XN \ 
in a continued fraction. The polynominal /?;(A) is 
the denominator of the ith convergent For our 


is to write the continued 


purposes it convenient 
fraction in the form 


The denominators of the convergents are given by 


the recursion formulas 

R / 1rA)R,—eR R 18:3 

However in ordet to 
carried 

The numbers 
that 


with (15:1 
the expansion must be 
virtue of (15:2 

then fIiVel by 15:4 It 


This coincides 
satisfy 14:3), 
so that d ri 1, by 
b, are 


out 


s cleat 


wher 


Let us translate these results into the n-dimensional 
As before. we 
svmmetric matrix A 

Let é;, , é be 


space given by our correspondence 
positive definite 
eigenvalues \,, a 


construct a 
with 


corresponding eigenvectors of unit length and choose. 


aus before 


The eigeny alues are the re¢ iproe als of the squares of 
the semiaxis of the (n—1)-dimensional ellipsoid 
(x, Ag l. The hyperplane, ro, d 0. cuts this 
ellipsoid in an (n—2)-dimensional eltipsoid, £,_», the 
squares of whose semiaxis are given by the reciprocals 
of the zeros of the numerator ¢ h) of F(A 

This follows from the fact that if \) is a number 
such that there is a vector rm +0 orthogonal to 7 
having the property that (Az»,7)=9(2», 7) whenever 
0), then A» is the square of the reciprocal of 
the semiaxis of /,., whose direction is given by z 
If the coordinate system Is chosen so that the axes 
given by e@,, oc respectively, then A= 


(79, 2 


are 
satisfies the equation 


as was to be proved. 


Let us call the zeros of t/, ) the « ige nealues of A 
with respect to ro and the polynomial @,_,(A) the 
characteristic polynomial of A with respect lo? The 
rational function F(A) is accordingly the quotient of 
this polynomial and the characteristic polynomial of 
A. Hence we have, 

Theorem 18:1. The 


the eg-process of a mati id 


connected with 
A and a vector 2 


n umbers a, b 


can he com- 


puted by expanding into a continued fraction the 
tient built by the characteristic polynomial of A 
respect tor and the ordinary characteristic poly 
of A 

This is the simplest form of the relation bety 
matrix A, a vector ry and the numbers a,, b, « 
corresponding eg-process. The theorem may by 
to investigate the behavior of the a,, 6, if the « 
values of A and those with respect to ry are 
The following special case is worth recordin 
1, the rational function 
derivative of the characteristic 
18:1) follows 


iii hi Wm, 

logarithmic 

nomial. From theorem 
Theorem If the rector 7 


sum of the normalized eigenvectors of 


18:2 of a cg-proces 
A, the 1 
a,, 6, may be com puted by ¢ rpanding the loga 
of the 
continued fraction 
Finally, we are able to prove 
Theore m 18:3. There is no 


constants a . h ini the 


derivative characte , istic polynomial of A 


ction whate 


restr 
the positive cg-process, tl 
given two sequences of positive numbers do, a 
a and bh . h , b there ] 
definite A and a vector such that thi 
algorithm applied to A.? yield the given n umbers 

The demonstration goes along the following 
From (15:2) and (15:4), we compute the numbers 
d,, the ¢, being again positive. Then we use the 
tinued fraction (18:2) to compute F(A) which 
decompose into partial fractions to obtain 
We show next that the numbers \;, m,; appearing 
18:1) are positive. After this has been establish: 
our correspondence finishes the proof. 

In order to prove that A, >0, m,>0 we observe tl 
the ratio R R 


be seen from (18:3) 


is a symmetric p 


matris 


IS 


by induction 


, RO) build a Sturmian seq 
in the following sense. The number of zeros of R 
in any interval a< \<} is equal to the increase of 
number of variations in sign in going from a 
At the point \) there are no variations in sign si 
R(0O)=1 for every +. At dr ©, there are exact! 
variations because the coefficient of the highest pow 
of X in R#,(A) ts 1) ‘aga a Therefor 
roots A,, de, , A, of R,(d) are real and positiv 

That the function F(A) ts itself a decreasing fw 
tion of \ follows directly from (18:2). Therefor 
residues M,;, Moe, o oy , OTS positive 

In view of theorem 18:3 the numbers a, in a 
process can increase as fast as desired. This 
was used in section 8.2. Furthermore, the forn 


Rd), RA 


shows that there is no restriction at all on tl 
havior of the length of the residual vector during 
cg-process. Hence, there are certainly exa 
where the residual vectors increase in length du 
the computation, as was stated earlier. This 
in spite of the fact that the error vector h—z de 

in length at each step. 
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is a decreasing function of \, as ca 
Using this result 
it is not too difficult to show that the polynomials 


19. Numerical Illustrations 


number of numerical experiments have been 


with the processes described in the preceding 
ms. <A preliminary report on these experiments 


be given in this section. In carrying out these 
iments, no attempt was made 
1 favored the method Normally, we selected 
which might lead to difficulties 

carrying out these experiments three 
ilas for a;, 6, were used in the symmetric cas¢ 


to select those 


sets of 


iV, 


Ap 


we have used only the 


is 


19:4 


xperience thus far indicates that the best 
s are obtained by the use of (19:1 Formulas 

were about as good as (19:1 except for very 
litioned matrices. Most of our experiments 
carried out with the use of (19:2) because they 
omewhat simpler than (19:1 Formulas (19:3 
lesigned to improve the relations 


(0). p;,Ap (0. 19:5 
they did. Unfortunately, they disturbed the 
t of the relations 
Pa 0. } Ap 0 19:6 
ection of the geometrical interpretation of the 
“l will convince one that one should strive to 

the relations (19:6) rather than (19:5 It is 
s reason that (19:1) appears to be considerably 
or to (19:3 In place of (19:2 can 


rmulas 


one use 


rrect rounding off errors. A_ preliminary 
ment indicates that this choice is better than 


and is perhaps as good as (19:1 


A sufficient number of experiments have not been 
carried out as yet so as to determine the “best”’ 
formulas to be used Our experiments do indicate 
that floating operations should be used whenever 
possible We have that the 
in the (n+ 1)st and (n+2)nd iterations are normally 
far superior the nth iteration 

Exam ple | This example was selected to illus- 
trate the method of 
there are no rounding off 
was chosen to be the matrix 


also obs rved results 
to those obtained i 


gradients in case 
The matrix A 


conjugate 


errors 


the 


st p 


the vector 0.2 1.1 


If we select k to be 
computation is simple 
are given in table 1 
Normally, the computation is not as simple as in- 
dicated in the preceding case. For example, if 
selects the solution A to be the vector (1.1.1.1), then 
k is the vector (3,9,.5,6 The results with (0,0.0,0) 
as the initial estimate is given by table 2 


results at each 


one 





The system just described is particular] 
suited for elimination. In case & is the 
3.9.5.6) the procedure described in section 12 
the results given in table 3. In this table, w 
with the matrices A and J. These matri 
transformed into the matrices P*A and P* ¢ 
the bottom of the table 

It is of interest to compare the erro! 
y h x, obtained by the two methods just ce 
with k=(3, 9, 5,6). The error |y;| is given 
following table 


eg-method Elimination met} 


0. OO 


In the eg-method y,; decreases monotonically 
in the elimination method |y;| increases except 
the last step 

Exam ple 2. In this case the matrix A was « 
to be the matrix 


This matrix is a well-conditioned matrix, its ei 
values lying on the range A,;=.6035S5ASX 
The computations were carried out on an IBM 
card programmed calculator with about seven sig 
nificant figures. The results for the case in whi 
r, is the origin and A the vector (1,1,1,1,1 
given in table 4 

Exam ple 3 A good illustration of the effi 
rounding can be obtained by study of an ill- 
ditioned system of three equations with thre: 
knowns, namely, the system 


1324+-29y — 382 

17z—38y +502 
whose solution is z=1, y 3,2 2. Thesyster 
was constructed by E. Stiefel. The eigenvalues 


A are given by the set 


d, =.0588, d= .2007, hs = 84.7405 


The ratio of the largest to the smallest eigenval 
is very large: \;/A,=1441. The formulas (19:1 
(19:2), and (19:3) were used to compute the sol 
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0060068 


O000K 
KO00084 
OO 22 
100064 
WOole 


OO0008 


ng five significant figures at all times. For 
arison, the computation was carried out 
10 digits, using (19:2 

ble 5. In 
the better result In iteration, 
as (19:1) and (19:2) were equally good, and 
or to (19:3). The solution was also carried out 
elimination method using only five significant 

The results are 


also 


The results are given 
the third iteration, formula (19:1 


the fourth 


1. 0000000003 
WIGUGOUOY 7 


POUOGUUOG. 


eg-method (19:1) Elimination pen anette 
001882 00094441 

. 99424 1. 06603 

3. 00506 


2. 00180 


00000000002 
2. 99518 


1. 99328 





In this case the results by the cg-method and elimin- Several symmetric systems, some involvi 

The | many as twelve unknowns, have been solved 
1pm card programed calculator. In on 
where the ratio of the largest to the smallest 
a satisfactory solution has 


ation method appear to be equally effective. 
cg-method has the advantage that an improvement 
can be made by taking one additional step. 

This example is also a good illustration for the 
the size of the residuals is not a reliable 
to the solution. In 


value was 4.9, 
obtained already in the third step; in anothe 
where this ratio was 100, one had to carry out 
steps in order to get an estimate with six « 
although the estimate in case 1 is very far | digits. In these computations floating ope: 
case 3 we are | were not used. At all times an attempt was 
close to it. to keep six or seven significant figures 

Further examples. The largest system that has The eg-method has also been applied t 
been solved by the cg-method is a linear, symmetric | solution of small nonsymmetric systems o 
system of 106 difference equations. The computa- | swac. The results indicate that the met} 
tion was done on the Zuse relay-computer at the | very suitable for high speed machines 
Institute for Applied Mathematics in Zurich. The A report on these experiments is being pri 
estimate obtained in the 90th step was of sufficient | at the National Bureau of Standards, Los Ang 


fact that 
criterion for how close one 18 
step 3 the residuals in case 1 are smaller than those 
ot case 3, 
from the right solution, whereas in 


accuracy to be acceptable 


Los ANGELES, May 8, 1952 
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